2 20 9

shijie xia

seven-cat

https://shijie-xia.github.io/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 9 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

upvoted a paper 17 days ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

upvoted a paper 24 days ago

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

View all activity

Organizations

upvoted a paper 9 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published 10 days ago • 50

upvoted a paper 17 days ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published 17 days ago • 124

upvoted a paper 24 days ago

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Paper • 2601.11044 • Published 28 days ago • 34

upvoted 2 papers about 1 month ago

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Paper • 2601.03111 • Published Jan 6 • 10

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 65

upvoted a paper 3 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 97

upvoted a paper 4 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 55

upvoted an article 4 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted a paper 4 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84

upvoted 2 papers 5 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Paper • 2509.09286 • Published Sep 11, 2025 • 11

upvoted 2 papers 8 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 47

ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering

Paper • 2506.09050 • Published Jun 10, 2025 • 6

upvoted 2 papers 9 months ago

Thinking with Generated Images

Paper • 2505.22525 • Published May 28, 2025 • 15

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

upvoted 3 papers 10 months ago

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Paper • 2504.13828 • Published Apr 18, 2025 • 18

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3, 2025 • 35

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3, 2025 • 32

upvoted a paper about 1 year ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published Jan 6, 2025 • 14

upvoted a collection over 1 year ago

Long Context

Collection

53 items • Updated Jun 5, 2025 • 8

shijie xia

AI & ML interests

Recent Activity

Organizations

seven-cat's activity

mem-agent: Equipping LLM Agents with Memory Using RL