Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Vidhi Jain's picture
1

Vidhi Jain

vidhij
lunarflu's profile picture
·
https://vidhijain.github.io
  • viddivj
  • vidhiJain

AI & ML interests

Language, Vision, Robotics

Organizations

CMU-LTI's profile picture

authored 8 papers over 1 year ago

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Paper • 2403.12026 • Published Mar 18, 2024 • 2

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

Paper • 2312.08782 • Published Dec 14, 2023 • 6

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Paper • 2310.08864 • Published Oct 13, 2023 • 2

Transformers are Adaptable Task Planners

Paper • 2207.02442 • Published Jul 6, 2022

MAEA: Multimodal Attribution for Embodied AI

Paper • 2307.13850 • Published Jul 25, 2023

Spatial-Language Attention Policies for Efficient Robot Learning

Paper • 2304.11235 • Published Apr 21, 2023

Learning Embeddings that Capture Spatial Semantics for Indoor Navigation

Paper • 2108.00159 • Published Jul 31, 2021

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Paper • 2403.12943 • Published Mar 19, 2024 • 15
authored a paper about 2 years ago

HomeRobot: Open-Vocabulary Mobile Manipulation

Paper • 2306.11565 • Published Jun 20, 2023 • 16
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs