2 24 11

haoxintong

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

upvoted a paper 5 days ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

upvoted a paper 7 days ago

Cartridges: Lightweight and general-purpose long context representations via self-study

View all activity

Organizations

haoxintong's activity

upvoted a paper about 6 hours ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published about 19 hours ago • 158

upvoted a paper 5 days ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 79

upvoted 2 papers 7 days ago

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published 11 days ago • 5

Reinforcement Pre-Training

Paper • 2506.08007 • Published 8 days ago • 208

upvoted a paper 12 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 18 days ago • 124

upvoted a paper 14 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 15 days ago • 154

upvoted a paper 20 days ago

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Paper • 2505.21067 • Published 21 days ago • 3

upvoted 2 papers 21 days ago

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published 22 days ago • 16

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published 22 days ago • 42

upvoted 2 papers 22 days ago

QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

Paper • 2505.18092 • Published 25 days ago • 43

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published 25 days ago • 87

upvoted a paper 28 days ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17 • 57

upvoted 4 papers about 1 month ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 143

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 26

Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset

Paper • 2412.02595 • Published Dec 3, 2024 • 5

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15 • 11

upvoted a paper about 2 months ago

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Paper • 2504.12322 • Published Apr 11 • 28

upvoted 2 papers 2 months ago

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published Apr 15 • 64

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?

Paper • 2504.00509 • Published Apr 1 • 22

upvoted a paper 3 months ago

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20 • 18