Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
1
SONGJUN TU
SONGJUNTU
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
authored
a paper
1 day ago
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
authored
a paper
1 day ago
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
View all activity
Organizations
SONGJUNTU
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
1 day ago
Yuqian-Fu/SRFT
Text Generation
•
Updated
about 5 hours ago
•
2