InfiR: Reasoning-Enhanced Low-Resource Training Pipeline Collection InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning • 9 items • Updated about 17 hours ago
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models Paper • 2507.23682 • Published 8 days ago • 22
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 5 days ago • 100 • 13
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 5 days ago • 100
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published 28 days ago • 77
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published 25 days ago • 84
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published 29 days ago • 32
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling Paper • 2506.22049 • Published Jun 27 • 2 • 1
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Paper • 2506.08012 • Published Jun 9 • 7
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Paper • 2506.07530 • Published Jun 9 • 20
Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking Paper • 2505.20199 • Published May 26 • 2 • 2
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published May 22 • 33
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published May 22 • 12