James Johnson's picture

James Johnson

wyattbaker

AI & ML interests

Research on LLM agents and evaluation. Mostly focused on experiments.

Recent Activity

liked a dataset 3 days ago

gaia-benchmark/GAIA

upvoted a paper 7 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

liked a model 7 days ago

devifr/my-model

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 14 days ago • 186

upvoted a paper 9 days ago

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Paper • 2605.07465 • Published 13 days ago • 29

upvoted 2 papers 14 days ago

Chain of Evidence: Pixel-Level Visual Attribution for Iterative Retrieval-Augmented Generation

Paper • 2605.01284 • Published 19 days ago • 3

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published 21 days ago • 57

upvoted a paper 20 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 21 days ago • 217

upvoted a paper 28 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

upvoted 4 papers about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 245

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics

Paper • 2604.08503 • Published Apr 9 • 7

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

upvoted a paper about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

upvoted 3 papers 2 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 149

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248