sophia peng
sophiapeng
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Harnessing Negative Signals: Reinforcement Distillation from Teacher
Data for LLM Reasoning
updated
a model
3 months ago
infly/INFLogic-Qwen2.5-32B-RL-Preview
liked
a model
3 months ago
infly/INFLogic-Qwen2.5-32B-RL-Preview