Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
13
8
Jie Liu
jieliu
Follow
DecoderWQH666's profile picture
sefira32's profile picture
xiao-lin's profile picture
16 followers
·
17 following
yifan123
AI & ML interests
Reinforcement Learning, Large Language Model
Recent Activity
authored
a paper
about 10 hours ago
Flow-GRPO: Training Flow Matching Models via Online RL
upvoted
a
paper
about 16 hours ago
Flow-GRPO: Training Flow Matching Models via Online RL
commented
on
a paper
about 18 hours ago
Flow-GRPO: Training Flow Matching Models via Online RL
View all activity
Organizations
Papers
7
arxiv:
2505.05470
arxiv:
2501.13918
arxiv:
2407.16154
arxiv:
2406.11817
Expand 7 papers
models
7
Sort: Recently updated
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-math-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24-seq2048
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5
Updated
Sep 3, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-beta0.5
Updated
Jul 30, 2024
jieliu/Storm-7B
Text Generation
•
Updated
Jun 18, 2024
•
12
•
41
datasets
1
jieliu/homepage
Viewer
•
Updated
Feb 10
•
4
•
237