1 21 89

Maojia Song

OrangeEye

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Attention Residuals

upvoted a paper about 1 month ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

liked a model about 1 month ago

Gustrd/SCUT-FBP5500-PyTorch-Model

View all activity

Organizations

upvoted 2 papers about 1 month ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 149

upvoted 2 papers about 2 months ago

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published Feb 24 • 23

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Paper • 2602.14296 • Published Feb 15 • 51

upvoted 4 papers 4 months ago

upvoted a paper 5 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 111

upvoted 2 papers 6 months ago

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Paper • 2508.18321 • Published Aug 24, 2025 • 2

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

Paper • 2510.05137 • Published Oct 1, 2025 • 6

upvoted a paper 7 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

upvoted a paper 8 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7, 2025 • 142

upvoted an article 10 months ago

Article

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

Dec 25, 2024

•

upvoted a paper 12 months ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39

upvoted an article about 1 year ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

•

upvoted 3 collections about 1 year ago

Long Reasoning

Collection

Datasets with reasoning traces for math and code (Train + Eval) • 49 items • Updated Mar 21, 2025 • 1

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2, 2025 • 61

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 188

upvoted a paper over 1 year ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

Maojia Song

AI & ML interests

Recent Activity

Organizations

OrangeEye's activity

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

The N Implementation Details of RLHF with PPO