Qi Liu (SJTU & SII)

purewhite42

Purewhite2019

AI & ML interests

Machine Learning, Formal Mathematics CS PhD Student @ ReThinklab, SJTU and SII (an institution dedicated to innovation in education and research in the field of AI)

Recent Activity

upvoted a paper 20 days ago

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper about 2 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

View all activity

Organizations

upvoted a paper 20 days ago

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

Paper • 2601.14027 • Published 25 days ago • 12

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225

upvoted 2 papers about 2 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 86

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 169

upvoted 2 papers 4 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15, 2025 • 58

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

upvoted an article 6 months ago

Article

Kimina-Prover-RL

Aug 14, 2025

•

upvoted 4 papers 8 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 50

Mathesis: Towards Formal Theorem Proving from Natural Languages

Paper • 2506.07047 • Published Jun 8, 2025 • 6

Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usability

Paper • 2506.08300 • Published Jun 10, 2025 • 9

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Paper • 2506.09501 • Published Jun 11, 2025 • 19

upvoted 3 papers 9 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 331

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 76

Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving

Paper • 2505.04528 • Published May 7, 2025 • 12

upvoted a collection 9 months ago

Formal Problem-Solving

Collection

This collection is part of the official implementation of Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving. • 5 items • Updated May 8, 2025 • 3

upvoted 2 papers 10 months ago

TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Paper • 2504.15780 • Published Apr 22, 2025 • 6

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14, 2025 • 85

upvoted a collection 10 months ago

xVerify

Collection

15 items • Updated Dec 3, 2025 • 13

upvoted 2 papers 12 months ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published Feb 26, 2025 • 20

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20, 2025 • 47

Qi Liu (SJTU & SII)

AI & ML interests

Recent Activity

Organizations

purewhite42's activity

Kimina-Prover-RL