Tongyao PRO
tyzhu
AI & ML interests
Natural Language Processing
Recent Activity
published
a model
about 6 hours ago
tyzhu/deepscaler-2k-1.5b
upvoted
a
paper
about 6 hours ago
Understanding R1-Zero-Like Training: A Critical Perspective
updated
a model
about 18 hours ago
tyzhu/deepscaler-2k-1.5b
Organizations
None yet
Collections
8
models
245
tyzhu/deepscaler-2k-1.5b
Updated
tyzhu/tiny_LLaMA_1b_4k_proweb_4k
Updated
tyzhu/tiny_LLaMA_360M_8k_hybridsc_cc_8k
Updated
tyzhu/anchorcontext_3b_cont_models
Updated
tyzhu/tiny_LLaMA_120M_8k_hybrid_cc_8k
Updated
tyzhu/llama_7b_skyladder_decay
Updated
•
1
tyzhu/llama_7b_decay
Updated
•
1
tyzhu/code_tiny_LLaMA_1b_8k_code_8k_iter-200000-ckpt-step-100000_hf
Text Generation
•
Updated
•
198
tyzhu/llama_7b_500B
Updated
tyzhu/vlongva_checkpoints
Updated
datasets
827
tyzhu/anchorcontext_5M_v4_models
Updated
•
7
tyzhu/cc_subset
Viewer
•
Updated
•
10.6M
•
173
•
1
tyzhu/proweb_dec2
Updated
•
15
tyzhu/cc_merged_v2
Viewer
•
Updated
•
12.6M
•
150
tyzhu/fineweb-edu-sorted
Viewer
•
Updated
•
36.1M
•
125
tyzhu/arc_c_tr
Viewer
•
Updated
•
2.32k
•
87
tyzhu/arc_e_tr
Viewer
•
Updated
•
9.82k
•
69
tyzhu/hellaswag_tr
Viewer
•
Updated
•
17.5k
•
69
tyzhu/tpo
Viewer
•
Updated
•
269
•
42
tyzhu/quality
Viewer
•
Updated
•
173
•
60