Shaobai Jiang
shaobaij
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 5 hours ago
RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
upvoted
a
paper
about 5 hours ago
FlexOlmo: Open Language Models for Flexible Data Use
upvoted
a
paper
about 15 hours ago
RL-PLUS: Countering Capability Boundary Collapse of LLMs in
Reinforcement Learning with Hybrid-policy Optimization
Organizations
None yet