16 6 25

An Yang

yangapku

https://scholar.google.com/citations?user=vO9FZekAAAAJ

AI & ML interests

NLP and Deep Learning

Recent Activity

authored a paper 1 day ago

Group Sequence Policy Optimization

liked a model 6 days ago

Qwen/QwQ-32B

liked a Space 6 days ago

Qwen/Qwen3-Coder-WebDev

View all activity

Organizations

authored a paper 1 day ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 5 days ago • 207

authored 3 papers about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

Paper • 2505.24147 • Published May 30

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 67

authored 7 papers 2 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 72

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 264

authored 9 papers 10 months ago

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 36

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining

Paper • 2003.13198 • Published Mar 30, 2020

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

Paper • 2305.14688 • Published May 24, 2023

M6: A Chinese Multimodal Pretrainer

Paper • 2103.00823 • Published Mar 1, 2021

M6-T: Exploring Sparse Expert Models and Beyond

Paper • 2105.15082 • Published May 31, 2021 • 1

Prompt Tuning for Generative Multimodal Pretrained Models

Paper • 2208.02532 • Published Aug 4, 2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese

Paper • 2211.01335 • Published Nov 2, 2022 • 1

Transferring General Multimodal Pretrained Models to Text Recognition

Paper • 2212.09297 • Published Dec 19, 2022

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166

An Yang

AI & ML interests

Recent Activity

Organizations

yangapku's activity