Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Tianlin Liu
tianlinliu0121
Follow
gqjia's profile picture
1 follower
·
1 following
https://tianlinliu.com/
liutianlin0121
AI & ML interests
None yet
Articles
The N Implementation Details of RLHF with PPO
Oct 24, 2023
•
18
Organizations
Papers
2
arxiv:
2402.04792
arxiv:
2402.02992
models
4
Sort: Recently updated
tianlinliu0121/zephyr-7b-dpo-full-debug-regression
Text Generation
•
Updated
Dec 7, 2023
•
13
tianlinliu0121/zephyr-7b-dpo-full-beta-0.2
Text Generation
•
Updated
Nov 23, 2023
•
10
tianlinliu0121/zephyr-7b-dpo-full-beta-0.083
Text Generation
•
Updated
Nov 19, 2023
•
3
tianlinliu0121/zephyr-7b-dpo-full
Text Generation
•
Updated
Nov 18, 2023
•
7
datasets
None public yet