Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation Paper • 2502.16707 • Published 3 days ago • 10
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated 10 days ago
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated 10 days ago
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated 10 days ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Paper • 2207.14800 • Published Jul 29, 2022
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control Paper • 2307.00117 • Published Jun 30, 2023 • 6