Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JunxiongWang
/
mamba_0_75_dpo_ep3
like
0
Text Generation
Transformers
PyTorch
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
conversational
text-generation-inference
arxiv:
2408.15237
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
mamba_0_75_dpo_ep3
/
README.md
Commit History
Update README.md
533247d
verified
JunxiongWang
commited on
Sep 2, 2024
add models
7005c37
Junxiong Wang
commited on
Jul 19, 2024
initial commit
45743e3
verified
JunxiongWang
commited on
Jul 18, 2024