Théo Pomies PRO
theopomies
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification
upvoted
a
paper
about 9 hours ago
R-Zero: Self-Evolving Reasoning LLM from Zero Data
upvoted
a
paper
1 day ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Organizations
None yet