PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Paper • 2505.05288 • Published 1 day ago • 5
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Paper • 2505.05288 • Published 1 day ago • 5
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 11 days ago • 22
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 11 days ago • 22
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory Paper • 2503.12668 • Published Mar 16 • 1
Mindstorms in Natural Language-Based Societies of Mind Paper • 2305.17066 • Published May 26, 2023 • 3
Learning to Identify Critical States for Reinforcement Learning from Videos Paper • 2308.07795 • Published Aug 15, 2023 • 7
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding Paper • 2503.17827 • Published Mar 22 • 8
Towards Data-Efficient Pretraining for Atomic Property Prediction Paper • 2502.11085 • Published Feb 16 • 3
SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs Paper • 2412.08347 • Published Dec 11, 2024 • 4
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards Paper • 2402.01781 • Published Feb 1, 2024 • 3
Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models Paper • 2411.06402 • Published Nov 10, 2024 • 2
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos Paper • 2011.13367 • Published Nov 26, 2020
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation Paper • 2105.04447 • Published May 10, 2021 • 1
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Paper • 2112.00431 • Published Dec 1, 2021