Andrew Ceniccola
andrewsamce
ยท
AI & ML interests
Deep Reinforcement Learning of Concepts (and the intersection of RL and NLP).
Recent Activity
upvoted
a
paper
about 1 month ago
TTRL: Test-Time Reinforcement Learning
upvoted
a
paper
about 1 month ago
Learning to Reason under Off-Policy Guidance
upvoted
a
paper
about 1 month ago
Does Reinforcement Learning Really Incentivize Reasoning Capacity in
LLMs Beyond the Base Model?
Organizations
models
0
None public yet
datasets
0
None public yet