MPCHAT: Towards Multimodal Persona-Grounded Conversation Paper • 2305.17388 • Published May 27, 2023 • 1
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning Paper • 2404.04682 • Published Apr 6, 2024
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Paper • 2501.13928 • Published Jan 23 • 17
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Paper • 2410.24218 • Published Oct 31, 2024 • 6
DANLI: Deliberative Agent for Following Natural Language Instructions Paper • 2210.12485 • Published Oct 22, 2022
What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets Paper • 2007.03626 • Published Jul 7, 2020
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Paper • 2406.05132 • Published Jun 7, 2024 • 31
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Paper • 2309.12311 • Published Sep 21, 2023 • 17
An Investigation of Representation and Allocation Harms in Contrastive Learning Paper • 2310.01583 • Published Oct 2, 2023
Simple Disentanglement of Style and Content in Visual Representations Paper • 2302.09795 • Published Feb 20, 2023 • 1