Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
1
Tongyao
PRO
tyzhu
Follow
0 followers
·
1 following
tongyao-zhu
AI & ML interests
Natural Language Processing
Recent Activity
updated
a dataset
about 1 hour ago
tyzhu/bio_n10_10k
published
a dataset
about 1 hour ago
tyzhu/bio_n10_10k
updated
a dataset
about 1 hour ago
tyzhu/bio_n1_10k
View all activity
Organizations
None yet
tyzhu
's models
430
Sort: Recently updated
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-em-warmup-0.05-rouge-rouge9
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-em-warmup-0.05-rouge-rouge7
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-em-warmup-0.05-rouge-rouge5
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-em-warmup-0.05-rouge-rouge3
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-it-em-warmup-0.05-rouge-rouge9
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-it-em-warmup-0.05-rouge-rouge7
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-it-em-warmup-0.05-rouge-rouge5
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-em-warmup-0.05-rouge-rouge9
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-em-warmup-0.05-rouge-rouge7
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-qwen2.5-3b-em-warmup-0.05-rouge-rouge5
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge9
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge7
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge5
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge3
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-em-warmup-0.05-rouge-rouge9
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-grpo-llama3.2-3b-em-warmup-0.05-rouge-rouge7
Updated
Jun 6
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge8
Updated
Jun 4
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge6
Updated
Jun 4
tyzhu/temp_models_from_dataset
Updated
Jun 4
tyzhu/tiny_LLaMA_1b_8k_rb5_am_v13_mix4_8k
Updated
Jun 4
tyzhu/tiny_LLaMA_1b_8k_rb5_am_v13_mix3_8k
Updated
Jun 4
tyzhu/cft_output_models_
Updated
Jun 4
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge7
Updated
Jun 3
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge5
Updated
Jun 3
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-it-em-warmup-0.05-rouge-rouge3
Updated
Jun 3
tyzhu/verl_checkpoints
Updated
Jun 2
tyzhu/nq_wikipedia_recite-r1-ppo-qwen2.5-3b-it-em-warmup-0.05-rouge-ro
Updated
Jun 1
tyzhu/nq_wikipedia_recite2-r1-ppo-llama3.2-3b-em-warmup-0.05-rouge-rou
Updated
Jun 1
tyzhu/nq_wikipedia_recite-r1-ppo-qwen2.5-3b-em-warmup-0.05-rouge-rouge
Updated
Jun 1
tyzhu/nq_wikipedia_recite2-r1-ppo-qwen2.5-3b-em-warmup-0.05-rouge-roug
Updated
Jun 1
Previous
1
2
3
4
5
...
15
Next