MAmmoTH-VL

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yuexiang96 authored a paper 4 days ago

Small Models Struggle to Learn from Strong Reasoners

yuexiang96 authored a paper 4 days ago

Evaluating Vision-Language Models as Evaluators in Path Planning

yuexiang96 authored a paper 4 days ago

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

View all activity

yuexiang96

authored 10 papers 4 days ago

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time

Paper • 2504.12329 • Published Apr 12

Overtrained Language Models Are Harder to Fine-Tune

Paper • 2503.19206 • Published Mar 24 • 2

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15 • 25

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 24

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 6 days ago • 54

aaabiao

authored a paper 5 days ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 6 days ago • 54

luodian

authored 2 papers 9 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 45

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published 11 days ago • 57

wenhu

authored 2 papers about 1 month ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 24

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

ubowang

authored a paper about 1 month ago

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

wenhu

authored a paper about 1 month ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

aaabiao

authored a paper 3 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

ubowang

authored 2 papers 3 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 105

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 43

AI & ML interests

Recent Activity

Team members 8

MAmmoTH-VL's activity