Interplay-LM-Reasoning

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yuexiang96 authored a paper 9 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

yuexiang96 authored a paper 9 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

yuexiang96 authored a paper 9 days ago

Simulating Environments with Reasoning Models for Agent Training

View all activity

yuexiang96

authored 4 papers 9 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28 • 27

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 17 days ago • 35

Clockz

in Interplay-LM-Reasoning/extrapolation_midtrain 12 days ago

Add pipeline tag, GitHub link, and improved model description

#1 opened 12 days ago by

nielsr

Clockz

in Interplay-LM-Reasoning/extrapolation_rl 12 days ago

Improve model card: Add pipeline tag and GitHub link

#1 opened 12 days ago by

nielsr

Clockz

updated 2 models 15 days ago

Interplay-LM-Reasoning/extrapolation_rl

Text Generation • Updated 12 days ago

Interplay-LM-Reasoning/extrapolation_midtrain

Text Generation • Updated 12 days ago

Clockz

updated a dataset 15 days ago

Interplay-LM-Reasoning/context

Updated 15 days ago • 9

Clockz

published 2 datasets 15 days ago

Interplay-LM-Reasoning/context

Updated 15 days ago • 9

Interplay-LM-Reasoning/extrapolation

Updated 15 days ago • 6

Clockz

published 3 models 15 days ago

Clockz

authored a paper 16 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 17 days ago • 35

yuexiang96

authored 5 papers 6 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 39

Evaluating Vision-Language Models as Evaluators in Path Planning

Paper • 2411.18711 • Published Nov 27, 2024

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 24

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Paper • 2503.19877 • Published Mar 25 • 1

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Paper • 2504.10342 • Published Apr 14 • 10

AI & ML interests

Recent Activity

Team members 2

Interplay-LM-Reasoning's activity

Add pipeline tag, GitHub link, and improved model description

Improve model card: Add pipeline tag and GitHub link