SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published 13 days ago ⢠162
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4 ⢠225
Towards Best Practices for Open Datasets for LLM Training Paper ⢠2501.08365 ⢠Published Jan 14 ⢠61
SelfCodeAlign: Self-Alignment for Code Generation Paper ⢠2410.24198 ⢠Published Oct 31, 2024 ⢠25