configint/SmolVLM2-256M-Video-Instruct-Action Image-Text-to-Text • 0.3B • Updated about 7 hours ago • 14
configint/SmolVLM2-256M-Video-Instruct-Action Image-Text-to-Text • 0.3B • Updated about 7 hours ago • 14
configint/SmolVLM2-500M-Video-Instruct-ActionTokens Image-Text-to-Text • 0.5B • Updated 8 days ago • 100 • 1
configint/SmolVLM2-500M-Video-Instruct-Action Image-Text-to-Text • 0.5B • Updated 8 days ago • 15 • 1
configint/SmolVLM2-500M-Video-Instruct-ActionTokens Image-Text-to-Text • 0.5B • Updated 8 days ago • 100 • 1
configint/SmolVLM2-500M-Video-Instruct-Action Image-Text-to-Text • 0.5B • Updated 8 days ago • 15 • 1
The Coverage Principle: A Framework for Understanding Compositional Generalization Paper • 2505.20278 • Published May 26 • 7
The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Paper • 2505.10185 • Published May 15 • 26
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Paper • 2406.15275 • Published Jun 21, 2024 • 12
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Paper • 2406.15275 • Published Jun 21, 2024 • 12
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners Paper • 2210.02969 • Published Oct 6, 2022
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning Paper • 2305.14045 • Published May 23, 2023 • 5
Exploring the Benefits of Training Expert Language Models over Instruction Tuning Paper • 2302.03202 • Published Feb 7, 2023 • 1
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards Paper • 2404.10346 • Published Apr 16, 2024 • 1
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17, 2024 • 32
Gradient Ascent Post-training Enhances Language Model Generalization Paper • 2306.07052 • Published Jun 12, 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records Paper • 2301.07695 • Published Jan 16, 2023 • 1
Exploring the Benefits of Training Expert Language Models over Instruction Tuning Paper • 2302.03202 • Published Feb 7, 2023 • 1
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning Paper • 2305.14045 • Published May 23, 2023 • 5