Zhaocheng Liu

zhaocheng

https://scholar.google.com/citations?user=Kk-dRIAAAAAJ

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

BadToBest/EchoMimicV2

liked a model 13 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a dataset 14 days ago

deepmind/code_contests

View all activity

Organizations

zhaocheng's activity

upvoted 3 papers 21 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 26 days ago • 106

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 23 days ago • 54

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 24 days ago • 183

upvoted 6 papers about 1 month ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 37

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 332

upvoted 3 papers 6 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 56

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 45

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 39

upvoted 6 papers 7 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 81

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

Paper • 2407.13301 • Published Jul 18, 2024 • 56

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Paper • 2407.16224 • Published Jul 23, 2024 • 27

EVLM: An Efficient Vision-Language Model for Visual Understanding

Paper • 2407.14177 • Published Jul 19, 2024 • 43

Visual Text Generation in the Wild

Paper • 2407.14138 • Published Jul 19, 2024 • 9

upvoted 2 papers 8 months ago

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 44

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 51