meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • Updated about 6 hours ago • 500 • 124
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated about 6 hours ago • 7.06k • 238
LettuceDetect: A Hallucination Detection Framework for RAG Applications Paper • 2502.17125 • Published Feb 24 • 10
Language Models can Self-Improve at State-Value Estimation for Better Search Paper • 2503.02878 • Published Mar 4 • 9
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids Paper • 2502.20396 • Published Feb 27 • 15
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs Paper • 2503.01307 • Published Mar 3 • 35
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition Paper • 2503.00735 • Published Mar 2 • 20