Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Haitao999
/
Llama-3.2-3B-Instruct-EMPO-numia_prompt_dpo1
like
0
Text Generation
Transformers
Safetensors
RLHFlow/numia_prompt_dpo1
llama
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.2-3B-Instruct-EMPO-numia_prompt_dpo1
Commit History
End of training
cb43f9b
verified
Haitao999
commited on
15 days ago
Model save
89040f1
verified
Haitao999
commited on
15 days ago
Training in progress, step 170
527b1e1
verified
Haitao999
commited on
15 days ago
Training in progress, step 160
e75cb1f
verified
Haitao999
commited on
15 days ago
Training in progress, step 130
a716624
verified
Haitao999
commited on
15 days ago
Training in progress, step 120
bf0f3de
verified
Haitao999
commited on
15 days ago
Training in progress, step 110
a5301ea
verified
Haitao999
commited on
16 days ago
Training in progress, step 100
e9de386
verified
Haitao999
commited on
16 days ago
Training in progress, step 90
6732a08
verified
Haitao999
commited on
16 days ago
Training in progress, step 60
7055657
verified
Haitao999
commited on
16 days ago
Training in progress, step 30
4d16880
verified
Haitao999
commited on
16 days ago
Training in progress, step 20
7374fab
verified
Haitao999
commited on
16 days ago
Training in progress, step 10
09dfca2
verified
Haitao999
commited on
16 days ago
End of training
fb96274
verified
Haitao999
commited on
16 days ago
Model save
521fc25
verified
Haitao999
commited on
16 days ago
End of training
6a67cfc
verified
Haitao999
commited on
16 days ago
Model save
9899458
verified
Haitao999
commited on
16 days ago
Training in progress, step 410
10b1498
verified
Haitao999
commited on
16 days ago
Training in progress, step 390
e3e8c5d
verified
Haitao999
commited on
17 days ago
Training in progress, step 370
d08bc9c
verified
Haitao999
commited on
17 days ago
Training in progress, step 350
d4668a2
verified
Haitao999
commited on
17 days ago
Training in progress, step 330
b32764a
verified
Haitao999
commited on
17 days ago
Training in progress, step 310
7d3ebcd
verified
Haitao999
commited on
17 days ago
Training in progress, step 290
df82bd6
verified
Haitao999
commited on
17 days ago
Training in progress, step 260
64e3d0f
verified
Haitao999
commited on
17 days ago
Training in progress, step 240
a8e6917
verified
Haitao999
commited on
17 days ago
Training in progress, step 220
13fdc83
verified
Haitao999
commited on
17 days ago
Training in progress, step 210
0ff25e4
verified
Haitao999
commited on
17 days ago
Training in progress, step 190
d61603a
verified
Haitao999
commited on
17 days ago
Training in progress, step 160
a5e0d45
verified
Haitao999
commited on
17 days ago
Training in progress, step 130
b473448
verified
Haitao999
commited on
17 days ago
Training in progress, step 110
284de38
verified
Haitao999
commited on
17 days ago
Training in progress, step 100
fd7c167
verified
Haitao999
commited on
17 days ago
Training in progress, step 80
39d6201
verified
Haitao999
commited on
17 days ago
Training in progress, step 60
cd35acb
verified
Haitao999
commited on
17 days ago
Training in progress, step 30
75b064a
verified
Haitao999
commited on
17 days ago
Training in progress, step 20
5b8f801
verified
Haitao999
commited on
17 days ago
Training in progress, step 10
91154a0
verified
Haitao999
commited on
17 days ago
initial commit
5522a39
verified
Haitao999
commited on
18 days ago