IXCLab@Shanghai AI Lab

community

https://github.com/OpenIXCLab

OpenIXCLab

AI & ML interests

None defined yet.

Recent Activity

yuhangzang updated a dataset 24 days ago

OpenIXCLab/mmlongbench-doc-results

yuhangzang updated a Space 24 days ago

OpenIXCLab/mmlongbench-doc

yuhangzang authored a paper about 2 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

View all activity

Papers

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

View all Papers

yuhangzang

updated a dataset 24 days ago

OpenIXCLab/mmlongbench-doc-results

Preview • Updated 24 days ago • 13

yuhangzang

updated a Space 24 days ago

MMLongBench Doc

A long-context, multimodal document understanding benchmark

yuhangzang

authored a paper about 2 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

yuhangzang

authored 2 papers 2 months ago

LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation

Paper • 2510.11063 • Published Oct 13, 2025 • 1

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

yuhangzang

authored a paper 3 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 29

myownskyW7

authored 5 papers 3 months ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1, 2025 • 63

Adaptive Fast-and-Slow Visual Program Reasoning for Long-Form VideoQA

Paper • 2509.17743 • Published Sep 22, 2025

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2, 2025 • 7

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Paper • 2305.17455 • Published May 27, 2023

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 67

yuhangzang

authored a paper 3 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

myownskyW7

authored a paper 3 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

yuhangzang

authored a paper 3 months ago

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 67

yuhangzang

authored a paper 4 months ago

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2, 2025 • 7

myownskyW7

authored 4 papers 4 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 37

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 42

yuhangzang

authored a paper 4 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 142