Suzie Oh's picture

Suzie Oh

ohsuz

·

ohsuz

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

numind/NuMarkdown-8B-Thinking

upvoted a collection 8 days ago

liked a model 10 days ago

Nanbeige/CoSineVerifier-Tool-4B

View all activity

Organizations

upvoted a collection 8 days ago

TranslateGemma

3 items • Updated 15 days ago • 199

upvoted a paper 15 days ago

Scaling Generalist Data-Analytic Agents

Paper • 2509.25084 • Published Sep 29, 2025 • 20

upvoted 2 papers 18 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 154

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

upvoted an article 18 days ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

115

upvoted 4 articles 25 days ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

25 days ago

•

37

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

75

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

42

Article

What makes good reasoning data

Oct 30, 2025

•

44

upvoted a collection 29 days ago

Open Korean LLM (MSIT 2025)

6 items • Updated 28 days ago • 15

upvoted a collection about 2 months ago

ToolRM

ToolRM: Towards Agentic Tool-Use Reward Modeling • 6 items • Updated 16 days ago • 4

upvoted an article 2 months ago

Article

How to Build an MCP Server with Gradio

Apr 30, 2025

•

202

upvoted a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 38

upvoted a collection 2 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

upvoted an article 2 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

+1

Sep 10, 2025

•

61

upvoted 5 papers 2 months ago

FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling

Paper • 2510.24645 • Published Oct 28, 2025 • 10

Spurious Rewards: Rethinking Training Signals in RLVR

Paper • 2506.10947 • Published Jun 12, 2025 • 2

Limits of Generalization in RLVR: Two Case Studies in Mathematical Reasoning

Paper • 2510.27044 • Published Oct 30, 2025 • 6

Data-Efficient RLVR via Off-Policy Influence Guidance

Paper • 2510.26491 • Published Oct 30, 2025 • 11

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 34