Jarrod Barnes PRO

Jarrodbarnes

jbarnes850

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

Qwen/Qwen3-30B-A3B-Thinking-2507

liked a model 9 days ago

Arc-Intelligence/arc-teacher-8b

liked a dataset 20 days ago

Salesforce/CRMArenaPro

View all activity

Organizations

upvoted a paper 20 days ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 24 days ago • 91

upvoted a paper 26 days ago

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Paper • 2507.23751 • Published Jul 31 • 4

upvoted 2 papers about 1 month ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 241

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9 • 6

upvoted an article about 1 month ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

and 4 others •

Jul 29

• 169

upvoted 2 papers about 1 month ago

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Paper • 2504.14286 • Published Apr 19 • 2

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

upvoted an article about 2 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 47

upvoted 3 papers about 2 months ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11 • 31

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 287

upvoted an article 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 647

upvoted a paper 2 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29 • 62

upvoted a collection 2 months ago

VisionLM

Collection

1448 items • Updated 2 days ago • 108

upvoted 2 papers 2 months ago

Listener-Rewarded Thinking in VLMs for Image Preferences

Paper • 2506.22832 • Published Jun 28 • 23

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28 • 12

upvoted a collection 2 months ago

QVQ

Collection

QVQ: Qwen models for visual reasoning • 7 items • Updated Jul 21 • 52

upvoted a paper 2 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 271

upvoted an article 4 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 186

upvoted a paper 4 months ago

TRAIL: Trace Reasoning and Agentic Issue Localization

Paper • 2505.08638 • Published May 13 • 6

Jarrod Barnes PRO

AI & ML interests

Recent Activity

Organizations

Jarrodbarnes's activity

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

SmolLM3: smol, multilingual, long-context reasoner

Let's talk about LLM evaluation