-
AmberYifan/llama2-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 18 -
AmberYifan/mistral-v0.1-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 17 -
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 21 -
AmberYifan/Gemma-2-9B-sft-ultrachat-safeRLHF
Text Generation • 9B • Updated • 18
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
AmberYifan/Qwen2.5-7B-Open-R1-Code-GRPO
updated
a model
1 day ago
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-sft
published
a model
1 day ago
AmberYifan/Qwen2.5-7B-Open-R1-Code-GRPO