post-training's picture

post-training

post-train

·

https://pypi.org/user/post-training/

post-training

AI & ML interests

None yet

Recent Activity

liked a Space 24 days ago

ResembleAI/Chatterbox

upvoted a paper 24 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

upvoted a paper 24 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

View all activity

Organizations

upvoted 7 papers 24 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published about 1 month ago • 44

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published about 1 month ago • 95

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published about 1 month ago • 225

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Paper • 2506.04142 • Published 25 days ago • 27

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 25 days ago • 45

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 27 days ago • 165

MiMo-VL Technical Report

Paper • 2506.03569 • Published 26 days ago • 73

upvoted 7 papers 5 months ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 34

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 52

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Paper • 2502.05176 • Published Feb 7 • 38

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11 • 45