The official datasets and model checkpoints of ARPO
KABI
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
liked
a model
about 6 hours ago
dongguanting/RAG-Critic-3B
upvoted
a
paper
about 7 hours ago
VeriGUI: Verifiable Long-Chain GUI Dataset
upvoted
a
paper
about 7 hours ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens