2 8 1

Graham Neubig

gneubig

http://www.phontron.com

AI & ML interests

NLP

Recent Activity

upvoted a paper about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

authored a paper 3 months ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

upvoted a paper 6 months ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

View all activity

Organizations

upvoted a paper about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 74

authored a paper 3 months ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15 • 26

upvoted a paper 6 months ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12 • 59

upvoted a paper 7 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

authored a paper 7 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

updated a dataset 8 months ago

gneubig/aime-1983-2024

Viewer • Updated Dec 21, 2024 • 933 • 3.11k • 12

authored a paper 8 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 52

upvoted a paper 8 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 52

updated a dataset 9 months ago

all-hands/openhands-feedback

Viewer • Updated Dec 14, 2024 • 275 • 12 • 4

upvoted a paper 9 months ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 22

authored a paper 9 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

upvoted a paper 9 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

authored a paper 9 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 49

liked a dataset 9 months ago

all-hands/openhands-feedback

Viewer • Updated Dec 14, 2024 • 275 • 12 • 4

upvoted a paper 9 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 49

authored a paper 9 months ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21, 2024 • 32

authored a paper 10 months ago

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22, 2024 • 15

upvoted a paper 10 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 45

authored 2 papers 10 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 45

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18, 2024 • 40

Graham Neubig

AI & ML interests

Recent Activity

Organizations

gneubig's activity