Zengzhi Wang's picture

Zengzhi Wang

SinclairWang

·

https://tinyurl.com/zengzhi-homepage

AI & ML interests

Data Engineering for Generative AI

Recent Activity

upvoted a paper about 1 month ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

upvoted a paper about 2 months ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

upvoted a collection about 2 months ago

View all activity

Organizations

upvoted a paper about 1 month ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published Apr 2 • 32

upvoted a paper about 2 months ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

upvoted a collection about 2 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 562

upvoted a paper about 2 months ago

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published Mar 13 • 30

upvoted a paper 2 months ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

upvoted a paper 4 months ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

upvoted 2 papers 5 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 88

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Paper • 2512.16912 • Published Dec 18, 2025 • 13

upvoted 3 papers 6 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 98

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Context Engineering 2.0: The Context of Context Engineering

Paper • 2510.26493 • Published Oct 30, 2025 • 9

upvoted 3 papers 7 months ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 93

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 16

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published Oct 20, 2025 • 80

upvoted an article 7 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

773

upvoted a paper 8 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

upvoted a paper 9 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 146

upvoted 3 collections 9 months ago

ProX General Models

base models trained on ProX curated data. • 7 items • Updated Mar 2 • 1

ProX Math Models

base models trained on ProX curated openwebmath-pro. • 5 items • Updated Oct 10, 2024 • 1

ProX Refining Models

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 5