-
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Paper • 2406.04151 • Published • 24 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 106 -
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117
Collections
Discover the best community collections!
Collections including paper arxiv:2509.25084
-
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 32 -
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
Paper • 2509.21710 • Published • 18 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 120 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6
-
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Paper • 2509.23338 • Published • 4 -
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Paper • 2510.02350 • Published • 3 -
RAG-Anything: All-in-One RAG Framework
Paper • 2510.12323 • Published • 54
-
zjunlp/DataMind-Analysis-Qwen2.5-7B
Text Generation • 8B • Updated • 10 • 2 -
zjunlp/DataMind-Analysis-Qwen2.5-14B
Text Generation • 15B • Updated • 4 • 2 -
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Paper • 2506.19794 • Published • 8 -
zjunlp/DataMind-Analysis-SFT-Data
Viewer • Updated • 2.82k • 34 • 1
-
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Paper • 2406.04151 • Published • 24 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 106 -
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117
-
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Paper • 2509.23338 • Published • 4 -
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Paper • 2510.02350 • Published • 3 -
RAG-Anything: All-in-One RAG Framework
Paper • 2510.12323 • Published • 54
-
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 32 -
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
Paper • 2509.21710 • Published • 18 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 120 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
-
zjunlp/DataMind-Analysis-Qwen2.5-7B
Text Generation • 8B • Updated • 10 • 2 -
zjunlp/DataMind-Analysis-Qwen2.5-14B
Text Generation • 15B • Updated • 4 • 2 -
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Paper • 2506.19794 • Published • 8 -
zjunlp/DataMind-Analysis-SFT-Data
Viewer • Updated • 2.82k • 34 • 1
-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6