34 28 127

Zhang Ruichong

ZhangRC

https://www.zhihu.com/people/triangjyeddriung

Triang-jyed-driung

AI & ML interests

Mathematics (Real analysis, functional analysis, commutative algebra, etc)

Recent Activity

liked a model 3 days ago

Qwen/Qwen3-0.6B

liked a dataset 13 days ago

happybusinessperson/political-compass

upvoted an article 15 days ago

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

View all activity

Organizations

liked a model 3 days ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated May 21 • 1.19M • • 407

liked a dataset 13 days ago

happybusinessperson/political-compass

Viewer • Updated Mar 11, 2024 • 61 • 23 • 3

upvoted an article 15 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

and 6 others •

16 days ago

• 101

upvoted a paper 22 days ago

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Paper • 2506.04956 • Published 23 days ago • 3

commented a paper 22 days ago

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Paper • 2506.04956 • Published 23 days ago • 3 •

liked 2 datasets 22 days ago

Mxode/Chinese-Instruct

Viewer • Updated May 9 • 4.85M • 2.12k • 45

opencsg/UltraFeedback-chinese

Preview • Updated Jan 14 • 372 • 12

liked a model 25 days ago

thu-coai/CritiqueLLM-6B

Text Generation • Updated Jun 28, 2024 • 68 • 5

upvoted a paper 26 days ago

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published Apr 10 • 11

liked 2 models about 1 month ago

BlinkDL/rwkv-8-pile

Updated May 26 • 10

fla-hub/rwkv7-2.9B-g1

Text Generation • 3B • Updated May 19 • 95 • 1

upvoted 2 papers about 2 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 175

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5 • 34

liked a Space about 2 months ago

211

MMLU-Pro Leaderboard

🥇

More advanced and challenging multi-task evaluation

New activity in Qwen/Qwen3-235B-A22B about 2 months ago

Qwen3 幻觉太高了，比 Qwen 2.5 差太多了

➕ 1

#24 opened about 2 months ago by

hehua2008

updated a model about 2 months ago

fla-hub/rwkv7-1.5B-g1

Text Generation • 2B • Updated May 7 • 60 • 1

liked a model about 2 months ago

fla-hub/rwkv7-1.5B-g1

Text Generation • 2B • Updated May 7 • 60 • 1

New activity in fla-hub/rwkv7-191M-world about 2 months ago

Issue: Unable to load model weights

#2 opened 2 months ago by

ZhangRC

liked a dataset about 2 months ago

Goose-World/rwkv-world-v3-subsample

Viewer • Updated Mar 16 • 1.1M • 79 • 1

upvoted a collection about 2 months ago

Qwen3

Collection

72 items • Updated 13 days ago • 806

Zhang Ruichong

AI & ML interests

Recent Activity

Organizations

ZhangRC's activity

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

MMLU-Pro Leaderboard

Qwen3 幻觉太高了，比 Qwen 2.5 差太多了

Issue: Unable to load model weights