MathCritique

community

https://mathcritique.github.io/

AI & ML interests

LLM Reasoning, Critique

Recent Activity

WooooDyy authored a paper 28 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

WooooDyy authored a paper 11 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

WooooDyy authored a paper 12 months ago

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

View all activity

models 0

None public yet

datasets 1

MathCritique/MathCritique-76k

Updated Nov 25, 2024 • 11 • 9