ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 2 days ago • 10
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published 9 days ago • 17
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper • 2501.16764 • Published 8 days ago • 21
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 8 days ago • 100
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Paper • 2406.18516 • Published Jun 26, 2024 • 3
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing Paper • 2501.13925 • Published 13 days ago • 5
Running on Zero 1.38k 1.38k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 14 days ago • 295
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 20 days ago • 67
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published 22 days ago • 31
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 22 days ago • 32
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 22 days ago • 272