Zhaolin Gao's picture

2 1 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a model 5 days ago

GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e4_lr_3e-7_1734672146

updated a model 5 days ago

GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e6_lr_3e-7_1734682709

updated a model 5 days ago

GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e5_lr_3e-7_1734677447

View all activity

Articles

RLHF 101: A Technical Dive into RLHF

Organizations

GitBag's activity

liked 3 models 3 months ago

Cornell-AGI/REBEL-Llama-3-Armo-iter_1

Updated Sep 2 • 12 • 1

Cornell-AGI/REBEL-Llama-3-Armo-iter_2

Updated Sep 2 • 13 • 2

Cornell-AGI/REBEL-Llama-3-Armo-iter_3

Updated Sep 2 • 9 • 2

liked a model 6 months ago

Cornell-AGI/REBEL-Llama-3-epoch_2

Text Generation • Updated Sep 1 • 10 • 3

liked 2 models 7 months ago

Cornell-AGI/REBEL-OpenChat-3.5

Text Generation • Updated Sep 1 • 12 • 1

Cornell-AGI/REBEL-Llama-3

Text Generation • Updated Sep 1 • 20 • 1