Yifan Jiang

YifanJ

·

AI & ML interests

None yet

Recent Activity

new activity 10 days ago

ChilleD/WebHarbor:Add target.tar.gz — rebuilt Target mirror assets

authored a paper about 1 month ago

MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning

authored a paper about 1 month ago

The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models

View all activity

Organizations

upvoted a paper about 1 month ago

AFFORDANCE20Q: Evaluating Affordance Reasoning from Physical Properties

Paper • 2606.14240 • Published Jun 12 • 5

upvoted 4 papers 4 months ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 98

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 62

Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation

Paper • 2603.18795 • Published Mar 19 • 17

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

upvoted 3 papers 6 months ago

VIDEOP2R: Video Understanding from Perception to Reasoning

Paper • 2511.11113 • Published Nov 14, 2025 • 113

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 52

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 215

upvoted a paper 8 months ago

ORION: Teaching Language Models to Reason Efficiently in the Language of Thought

Paper • 2511.22891 • Published Nov 28, 2025 • 8

upvoted a collection 8 months ago

VisionLM

1929 items • Updated May 25 • 151

upvoted 7 papers 8 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 235

MHR: Momentum Human Rig

Paper • 2511.15586 • Published Nov 19, 2025 • 14

FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI

Paper • 2511.13524 • Published Nov 17, 2025 • 7

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 78

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

Paper • 2511.11134 • Published Nov 14, 2025 • 33

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 47

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98

upvoted a paper 9 months ago

COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSes

Paper • 2409.04053 • Published Sep 6, 2024 • 1

upvoted a collection about 1 year ago

DyCodeEval

DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5. • 3 items • Updated Jun 27, 2025 • 4

upvoted a paper about 1 year ago

Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

Paper • 2503.04149 • Published Mar 6, 2025 • 6