Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ScienceOne-Math
classroom
Activity Feed
Follow
5
AI & ML interests
None defined yet.
Recent Activity
SONGJUNTU
authored
a paper
1 day ago
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
SONGJUNTU
authored
a paper
1 day ago
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
SONGJUNTU
authored
a paper
1 day ago
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
View all activity
Team members
5
S1-Math
's models
None public yet