Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 15 hours ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL
Training
upvoted
a
paper
about 16 hours ago
Distilled Pretraining: A modern lens of Data, In-Context Learning and
Test-Time Scaling
upvoted
a
paper
about 16 hours ago
Bootstrapping Task Spaces for Self-Improvement
Organizations
None yet