5 18 4

Xiyao Wang

russwang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

upvoted a paper 7 days ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

updated a dataset 9 days ago

russwang/LLaVA-Critic-GRPO-shortprompt

View all activity

Organizations

upvoted a paper 2 days ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 3 days ago • 35

upvoted a paper 7 days ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published 11 days ago • 63

upvoted a paper 17 days ago

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published 22 days ago • 22

upvoted a paper 24 days ago

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published 28 days ago • 33

upvoted a paper about 1 month ago

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Paper • 2505.20561 • Published May 26 • 7

upvoted a paper about 2 months ago

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Paper • 2504.20406 • Published Apr 29 • 7

upvoted a paper 3 months ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

Paper • 2503.17352 • Published Mar 21 • 23

upvoted a collection 3 months ago

ThinkLite-VL

Collection

5 items • Updated May 18 • 2

upvoted a paper 3 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10 • 19

upvoted a collection 5 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Apr 28 • 82

upvoted a paper 6 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39

upvoted 4 papers 7 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 64

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 7

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 114

upvoted a paper 9 months ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 38

upvoted 2 papers about 1 year ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18, 2024 • 32

Xiyao Wang

AI & ML interests

Recent Activity

Organizations

russwang's activity