100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper β’ 2505.00551 β’ Published 29 days ago β’ 36
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper β’ 2504.10483 β’ Published Apr 14 β’ 21
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper β’ 2504.08685 β’ Published Apr 11 β’ 124
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper β’ 2403.03206 β’ Published Mar 5, 2024 β’ 68
How far can we go with ImageNet for Text-to-Image generation? Paper β’ 2502.21318 β’ Published Feb 28 β’ 26
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published Feb 20 β’ 192
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper β’ 2501.09732 β’ Published Jan 16 β’ 72
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated 10 days ago β’ 155
NVLM 1.0 Collection A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. β’ 2 items β’ Updated 10 days ago β’ 51
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper β’ 2404.03715 β’ Published Apr 4, 2024 β’ 62
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. β’ 19 items β’ Updated Apr 12, 2024 β’ 68
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. β’ 43 items β’ Updated Apr 12, 2024 β’ 133