YeungNLP's picture

6 7 87

YeungNLP

YeungNLP

·

yangjianxin1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper about 1 month ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

upvoted a paper about 1 month ago

QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

View all activity

Organizations

upvoted 3 papers about 1 month ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 166

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

Paper • 2505.18092 • Published May 23 • 44

upvoted 2 papers 6 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

upvoted a paper 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

upvoted an article about 1 year ago

Article

Faster fine-tuning using TRL & Unsloth

By

•

Jan 10, 2024

• 62