Directly distill from Llama, the finetune in DPO
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
published
a model
8 days ago
JunxiongWang/Llama-3.2-1B-MATH
updated
a model
8 days ago
JunxiongWang/Llama-3.2-1B-MATH
published
a model
9 days ago
JunxiongWang/MambaInLlama1B_v4
Organizations
Collections
7
models
45
JunxiongWang/Llama-3.2-1B-MATH
Text Generation
•
Updated
•
9
JunxiongWang/MambaInLlama1B_v4
Updated
•
6
JunxiongWang/MambaInLlama3B_v4
Updated
•
17
JunxiongWang/MambaInLlama3B_DPO2
Updated
•
18
JunxiongWang/MambaInLlama3B_DPO1
Updated
•
9
JunxiongWang/MambaInLlama3B_v3.1
Updated
•
189
JunxiongWang/MambaInLlama3B_v3
Updated
•
133
JunxiongWang/MambaInLlama1B_v3
Updated
•
144
JunxiongWang/mamba_0_5_distill
Updated
•
4
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
•
26
datasets
11
JunxiongWang/test_math
Viewer
•
Updated
•
89.1k
•
87
JunxiongWang/FineMathV4
Viewer
•
Updated
•
6.7M
•
102
JunxiongWang/model_revision_max_4_closest_and_random
Viewer
•
Updated
•
530k
•
84
JunxiongWang/sftdatasetv4
Viewer
•
Updated
•
4.96M
•
80
JunxiongWang/sftdatasetv3
Viewer
•
Updated
•
12.4M
•
281
JunxiongWang/sftdatasetv2
Viewer
•
Updated
•
11.8M
•
68
JunxiongWang/sftdataset
Viewer
•
Updated
•
11M
•
482
•
2
JunxiongWang/llama3-ultrafeedback-armorm
Viewer
•
Updated
•
61.8k
•
151
•
1
JunxiongWang/gemma2_sftdataset
Viewer
•
Updated
•
11M
•
41
JunxiongWang/largetestdataset
Viewer
•
Updated
•
7.49M
•
34