DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 209
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published 13 days ago • 30
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 16 days ago • 29
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12, 2024 • 67
Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published 28 days ago • 20
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 207
Med42v2 Dataset Collection Based on the Table 5 in Appendix A in the original paper • 22 items • Updated Aug 14, 2024 • 2