AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-sft Text Generation • 8B • Updated 1 day ago • 15
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter2 Text Generation • 8B • Updated 2 days ago • 14 • 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 14
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft Text Generation • 8B • Updated 2 days ago • 28
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter2 Text Generation • 8B • Updated 2 days ago • 14 • 1
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-sft Text Generation • 8B • Updated 1 day ago • 15
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1 Text Generation • 8B • Updated 3 days ago • 32 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1 Text Generation • 8B • Updated 3 days ago • 32 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-4k-iter2 Text Generation • 8B • Updated 3 days ago • 15 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-4k-iter2 Text Generation • 8B • Updated 3 days ago • 15 • 1