3 6 2

JiachengXu

XiaoBanni

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

published a Space 2 months ago

XiaoBanni/ultrascale-playbook

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 183

upvoted a paper 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 205

published a Space 2 months ago

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

updated a Space 2 months ago

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook 2 months ago

Clarification Needed: Description of Gradient Accumulation's Peak Memory Impact Seems Incorrect

👍 1

#122 opened 2 months ago by

XiaoBanni

liked a Space 2 months ago

The Ultra-Scale Playbook

🌌

3.65k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 4 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 83

upvoted a paper 6 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 134

upvoted a paper 7 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 56

updated a dataset 8 months ago

XiaoBanni/TACO_with_solution

Viewer • Updated Jun 1, 2025 • 9.1k • 9

published a dataset 8 months ago

XiaoBanni/TACO_with_solution

Viewer • Updated Jun 1, 2025 • 9.1k • 9

upvoted a paper 8 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28, 2025 • 54

liked a model about 1 year ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B

Text Classification • Updated Aug 29, 2025 • 728 • 51

New activity in meta-llama/Llama-3.1-8B-Instruct about 1 year ago

Can't reproduce MATH performance

#66 opened over 1 year ago by

jpiabrantes

JiachengXu

AI & ML interests

Recent Activity

Organizations

XiaoBanni's activity

The Ultra-Scale Playbook

The Ultra-Scale Playbook

Clarification Needed: Description of Gradient Accumulation's Peak Memory Impact Seems Incorrect

The Ultra-Scale Playbook

Can't reproduce MATH performance