ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models Paper • 2311.07022 • Published Nov 13, 2023 • 1
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare Paper • 2404.16621 • Published Apr 25, 2024
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 1
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 1
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents Paper • 2410.23555 • Published Oct 31, 2024
Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems Paper • 2501.17348 • Published Jan 28
Language Model is All You Need: Natural Language Understanding as Question Answering Paper • 2011.03023 • Published Nov 5, 2020
VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator Paper • 2105.11589 • Published May 25, 2021
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 1
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model Paper • 2502.08820 • Published Feb 12 • 5
Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis Paper • 2502.04511 • Published Feb 6
LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language Paper • 2501.14073 • Published Jan 23
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation Paper • 2407.01158 • Published Jul 1, 2024