YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

onecat-ai/OneCAT-3B

liked a model 1 day ago

mPLUG/GUI-Owl-32B

liked a model 1 day ago

HuggingFaceTB/SmolLM3-3B-checkpoints

View all activity

Organizations

upvoted a paper 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 21 days ago • 36

upvoted a paper 13 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 16 days ago • 78

upvoted a collection 14 days ago

NVIDIA Nemotron

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 4 items • Updated 6 days ago • 56

upvoted a paper 23 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

upvoted a collection 23 days ago

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 10

upvoted a collection 26 days ago

Web-SSL

17 items • Updated Apr 23 • 19

upvoted 2 collections about 1 month ago

Physics of Language Models: Part 4.2

16 items • Updated Jul 29 • 4

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 30 days ago • 230

upvoted 4 papers about 2 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 294

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 69

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published May 20 • 9

upvoted a collection about 2 months ago

AM-Distilled-Dataset

AM-Distilled-Dataset • 5 items • Updated Jun 5 • 3

upvoted a paper 2 months ago

PyVision: Agentic Vision with Dynamic Tooling

Paper • 2507.07998 • Published Jul 10 • 31

upvoted an article 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 653

upvoted a paper 2 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 53

upvoted a collection 2 months ago

Skywork-Reward-V2

Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 23

upvoted 2 papers 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 77

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Paper • 2505.21411 • Published May 27 • 17

upvoted an article 3 months ago

Article

Large-scale Near-deduplication Behind BigCode

By

•

May 16, 2023

• 34