1 62 1

Rui Sun PRO

ThreeSR

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

updated a collection about 10 hours ago

New Papers

updated a collection about 10 hours ago

New Papers

upvoted a paper about 10 hours ago

Co-Director: Agentic Generative Video Storytelling

View all activity

Organizations

upvoted 2 papers about 10 hours ago

Co-Director: Agentic Generative Video Storytelling

Paper • 2604.24842 • Published 3 days ago • 13

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery

Paper • 2604.25256 • Published 2 days ago • 26

upvoted a paper 8 days ago

Mind DeepResearch Technical Report

Paper • 2604.14518 • Published 13 days ago • 23

upvoted a paper 12 days ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published 14 days ago • 35

upvoted a paper 17 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 21 days ago • 242

upvoted 7 papers 18 days ago

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

Paper • 2604.08224 • Published 21 days ago • 51

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 21 days ago • 47

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 21 days ago • 51

upvoted 2 papers 20 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 21 days ago • 49

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 22 days ago • 71

upvoted a paper 29 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 349

upvoted 2 papers about 1 month ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization

Paper • 2603.12743 • Published Mar 13 • 3

upvoted a paper 2 months ago

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Paper • 2307.00862 • Published Jul 3, 2023 • 1

upvoted 2 papers 3 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

Aligning Agentic World Models via Knowledgeable Experience Learning

Paper • 2601.13247 • Published Jan 19 • 15

Rui Sun PRO

AI & ML interests

Recent Activity

Organizations

ThreeSR's activity