Vidhi Jain's picture

1

Vidhi Jain

vidhij

·

https://vidhijain.github.io

AI & ML interests

Language, Vision, Robotics

Organizations

authored 8 papers almost 2 years ago

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Paper • 2403.12026 • Published Mar 18, 2024 • 2

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

Paper • 2312.08782 • Published Dec 14, 2023 • 6

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Paper • 2310.08864 • Published Oct 13, 2023 • 2

Transformers are Adaptable Task Planners

Paper • 2207.02442 • Published Jul 6, 2022

MAEA: Multimodal Attribution for Embodied AI

Paper • 2307.13850 • Published Jul 25, 2023

Spatial-Language Attention Policies for Efficient Robot Learning

Paper • 2304.11235 • Published Apr 21, 2023

Learning Embeddings that Capture Spatial Semantics for Indoor Navigation

Paper • 2108.00159 • Published Jul 31, 2021

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Paper • 2403.12943 • Published Mar 19, 2024 • 15

authored a paper over 2 years ago

HomeRobot: Open-Vocabulary Mobile Manipulation

Paper • 2306.11565 • Published Jun 20, 2023 • 16