arxiv:2407.12852
Tony Montes
t-montes
ยท
AI & ML interests
NLP, GenAI
Recent Activity
upvoted
a
paper
3 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
liked
a model
8 days ago
impira/layoutlm-document-qa
upvoted
a
paper
15 days ago
RL Zero: Zero-Shot Language to Behaviors without any Supervision
Organizations
Papers
2
spaces
5
models
None public yet
datasets
None public yet