ACECODER: Acing Coder RL via Automated Test-Case Synthesis Paper • 2502.01718 • Published 1 day ago • 13
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published about 18 hours ago • 14
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published 1 day ago • 6
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Paper • 2502.02508 • Published about 18 hours ago • 5
Generating Multi-Image Synthetic Data for Text-to-Image Customization Paper • 2502.01720 • Published 1 day ago • 2
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published 2 days ago • 8
A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation Paper • 2502.00314 • Published 4 days ago • 3
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Paper • 2502.01639 • Published 1 day ago • 19
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published 2 days ago • 10
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 1 day ago • 6
Improved Training Technique for Latent Consistency Models Paper • 2502.01441 • Published 2 days ago • 7
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 2 days ago • 10
Improving Transformer World Models for Data-Efficient RL Paper • 2502.01591 • Published 1 day ago • 7
FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation Paper • 2502.01068 • Published 2 days ago • 14
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 2 days ago • 25
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 2 days ago • 115
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 2 days ago • 96