Risa Tori's picture

Risa Tori

helloTR

AI & ML interests

None yet

Organizations

None yet

models 11

helloTR/llama2-7b-reverse-lora

helloTR/dpo-training-fixed

helloTR/llama3-dpo-pairrm-iter2

helloTR/iterative-dpo-pairrm

helloTR/sft-llama3.2-1b-lora

Updated Apr 21 • 3

helloTR/llama2-7b-dpo-final

helloTR/llama2-7b-dpo-forward

Text Generation • 7B • Updated Apr 20 • 5

helloTR/llama3-dpo-judge

Updated Apr 19 • 3

helloTR/llama3-dpo-pairrm

Updated Apr 19 • 3

helloTR/llama2-7b-reverse-instruction

datasets 5

helloTR/filtered-dpo-contrast-sample

Viewer • Updated Apr 21 • 500 • 4

helloTR/filtered-high-quality-dpo

Viewer • Updated Apr 20 • 10 • 17

helloTR/pairrm-preference-dataset

Viewer • Updated Apr 19 • 50 • 9

helloTR/reverse-seed-data

Viewer • Updated Apr 15 • 100 • 7

helloTR/dpo-contrast-sample

Viewer • Updated Apr 15 • 10 • 6