Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

liked a dataset about 13 hours ago

nvidia/OpenCodeInstruct

upvoted a paper 15 days ago

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

liked a model 17 days ago

moonshotai/Kimi-K2.6

View all activity

Organizations

liked a dataset about 13 hours ago

nvidia/OpenCodeInstruct

Viewer • Updated Apr 28, 2025 • 4.97M • 7.8k • 83

upvoted a paper 15 days ago

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Paper • 2604.18543 • Published 18 days ago • 27

liked a model 17 days ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 8 days ago • 1.07M • • 1.22k

upvoted a paper 17 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 21 days ago • 58

liked a dataset 22 days ago

lambda/hermes-agent-reasoning-traces

Viewer • Updated 20 days ago • 14.7k • 9k • 286

upvoted 3 papers 22 days ago

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

Paper • 2604.13010 • Published 24 days ago • 13

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published 24 days ago • 36

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 24 days ago • 90

upvoted 2 papers 26 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 29 days ago • 289

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 29 days ago • 261

liked a model 30 days ago

openbmb/VoxCPM2

Text-to-Speech • Updated 22 days ago • 165k • 1.29k

liked a dataset about 1 month ago

zai-org/CC-Bench-trajectories

Viewer • Updated Sep 30, 2025 • 260 • 770 • 94

upvoted 3 papers about 1 month ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 145

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published Mar 25 • 30

liked 2 datasets about 1 month ago

mercor/APEX-SWE

Updated 15 days ago • 11.1k • 24

mercor/apex-agents

Benchmark • Updated Mar 3 • 480 • 34k • 121

upvoted a paper about 1 month ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 77

New activity in Qwen/Qwen3.5-397B-A17B about 2 months ago

Can not reproduce evaluation results on SWE-Verified

#63 opened about 2 months ago by

cppowboy

upvoted a paper about 2 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity

Can not reproduce evaluation results on SWE-Verified