yang

rasonyang

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago

deepseek-ai/DeepSeek-R1-0528

liked a model about 2 months ago

deepseek-ai/DeepSeek-Prover-V2-671B

liked a model about 2 months ago

Qwen/Qwen3-30B-A3B

View all activity

Organizations

None yet

liked a model 27 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 27 days ago • 155k • • 2.09k

liked 4 models about 2 months ago

liked 3 models 4 months ago

perplexity-ai/r1-1776

Text Generation • Updated Feb 26 • 12.1k • 2.28k

deepseek-ai/DeepSeek-V3

Text Generation • Updated Mar 27 • 2.04M • • 3.89k

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 592k • • 12.4k

upvoted a paper 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 404

liked 2 models 9 months ago

superb/hubert-base-superb-ks

Audio Classification • Updated Nov 4, 2021 • 14.7k • 8

rasonyang/Llama-3.1-8B-Instruct-Surfer-Dude-Personality

Updated Sep 16, 2024 • 1

updated a model 9 months ago

rasonyang/Llama-3.1-8B-Instruct-Surfer-Dude-Personality

Updated Sep 16, 2024 • 1

updated a collection 11 months ago

papers

Collection

1 item • Updated Aug 2, 2024

upvoted a paper 11 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 117

liked a model 11 months ago

meta-llama/Llama-3.1-405B

Text Generation • Updated Sep 25, 2024 • 11.1k • 936

liked a model 12 months ago

shenzhi-wang/Gemma-2-9B-Chinese-Chat

Text Generation • Updated Jul 4, 2024 • 3.31k • 79

reacted to clem's post with 👍 about 1 year ago

Post

2924

Already almost 1,000 llama3 model variations have been shared publicly on HF (many more in private use at companies): https://huggingface.co/models?p=5&sort=trending&search=llama3.

Everyone should fine-tune their own models for their use-cases, languages, industry, infra constraints,...

10,000 llama3 variants by the end of next week?