Xinyu Fang

nebulae09

·

FangXinyu-0913

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

liked a dataset 1 day ago

internlm/RNGBench-Game-Trajectories

upvoted a paper 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

View all activity

Organizations

upvoted a collection 1 day ago

RNGBench

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games • 2 items • Updated 16 days ago • 3

upvoted a paper 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 22 days ago • 50

upvoted a paper 29 days ago

N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization

Paper • 2606.10768 • Published 30 days ago • 24

upvoted a paper about 2 months ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

upvoted 5 papers 3 months ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published Apr 20 • 96

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 86

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization

Paper • 2603.28342 • Published Mar 30 • 26

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

upvoted 2 papers 4 months ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 158

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published Mar 13 • 21

upvoted 3 papers 5 months ago

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 275

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

upvoted a paper 6 months ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published Dec 19, 2025 • 115

upvoted 5 papers 7 months ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published Sep 29, 2025 • 7

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 17

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50