siqi zhu's picture

Open to Work

3 13 7

siqi zhu

zsqzz

·

zhusq20

AI & ML interests

None yet

Organizations

upvoted 3 papers 2 months ago

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published Oct 27, 2025 • 11

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20, 2025 • 122

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13, 2025 • 28

upvoted a paper 3 months ago

GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

Paper • 2510.08872 • Published Oct 10, 2025 • 3

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted a paper 6 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 18

upvoted a paper 10 months ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

upvoted a paper 11 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6, 2025 • 51

upvoted a paper about 1 year ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36

upvoted a collection about 1 year ago

Synthetic Data and Self-Improvement

113 items • Updated Sep 26, 2025 • 9

upvoted 3 papers over 1 year ago

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 67

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 20