Zhengzhong Tu's picture

5 13 5

Zhengzhong Tu

vztu

·

https://vztu.github.io

_vztu
vztu

AI & ML interests

Generative AI, Multimodal AI, Trustworthy AI

Recent Activity

liked a Space 23 days ago

peiranli0930/CVAgentArena

upvoted a paper about 1 month ago

4KAgent: Agentic Any Image to 4K Super-Resolution

upvoted a paper about 1 month ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

View all activity

Organizations

upvoted 3 papers about 1 month ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 98

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16 • 26

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9 • 22

upvoted a paper about 2 months ago

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Paper • 2506.15645 • Published Jun 18 • 4

upvoted a paper 2 months ago

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Paper • 2506.07564 • Published Jun 9 • 6

upvoted 3 papers 3 months ago

Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing

Paper • 2411.16832 • Published Nov 25, 2024 • 2

DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models

Paper • 2505.24025 • Published May 29 • 27

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Paper • 2505.24871 • Published May 30 • 22

upvoted a paper 5 months ago

Impossible Videos

Paper • 2503.14378 • Published Mar 18 • 62

upvoted a paper 6 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

upvoted a paper 11 months ago

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26, 2024 • 35

upvoted 2 papers over 1 year ago

TIP: Text-Driven Image Processing with Semantic and Restoration Instructions

Paper • 2312.11595 • Published Dec 18, 2023 • 6

Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20