Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
6
Zhihe Yang
zhyang2226
Follow
21world's profile picture
1 follower
·
2 following
AI & ML interests
Trustworthy RL & Offline RL
Recent Activity
liked
a model
10 days ago
tencent/HunyuanVideo
authored
a paper
17 days ago
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
authored
a paper
17 days ago
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
View all activity
Organizations
zhyang2226
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
10 days ago
tencent/HunyuanVideo
Text-to-Video
•
Updated
Mar 6
•
1.66k
•
•
1.98k
liked
a dataset
3 months ago
BytedTsinghua-SIA/DAPO-Math-17k
Viewer
•
Updated
Apr 18
•
1.79M
•
5.41k
•
83
liked
a Space
5 months ago
Running
441
441
AI Deadlines
⚡
Organize project deadlines with AI assistance
liked
a dataset
5 months ago
openbmb/RLAIF-V-Dataset
Viewer
•
Updated
Mar 4
•
74.8k
•
1.92k
•
177
liked
a model
6 months ago
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
8B
•
Updated
Sep 2, 2024
•
78.4k
•
53
liked
a model
12 months ago
openbmb/RLHF-V
Text Generation
•
Updated
May 28, 2024
•
26
•
17