Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Risa Tori
helloTR
Follow
AI & ML interests
None yet
Organizations
None yet
models
11
Sort: Recently updated
helloTR/llama2-7b-reverse-lora
Updated
Apr 25
helloTR/dpo-training-fixed
Updated
Apr 22
helloTR/llama3-dpo-pairrm-iter2
Updated
Apr 22
helloTR/iterative-dpo-pairrm
Updated
Apr 22
helloTR/sft-llama3.2-1b-lora
Updated
Apr 21
•
3
helloTR/llama2-7b-dpo-final
Updated
Apr 20
helloTR/llama2-7b-dpo-forward
Text Generation
•
7B
•
Updated
Apr 20
•
5
helloTR/llama3-dpo-judge
Updated
Apr 19
•
3
helloTR/llama3-dpo-pairrm
Updated
Apr 19
•
3
helloTR/llama2-7b-reverse-instruction
Updated
Apr 1
View 11 models
datasets
5
Sort: Recently updated
helloTR/filtered-dpo-contrast-sample
Viewer
•
Updated
Apr 21
•
500
•
4
helloTR/filtered-high-quality-dpo
Viewer
•
Updated
Apr 20
•
10
•
17
helloTR/pairrm-preference-dataset
Viewer
•
Updated
Apr 19
•
50
•
9
helloTR/reverse-seed-data
Viewer
•
Updated
Apr 15
•
100
•
7
helloTR/dpo-contrast-sample
Viewer
•
Updated
Apr 15
•
10
•
6