5 8 7

Vincent Luo

Vincent-luo

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Welcome Gemma 4: Frontier multimodal intelligence on device

upvoted an article about 1 month ago

How I contributed a new model to the Transformers library using Codex

upvoted an article 3 months ago

SigLIP 2: A better multilingual vision language encoder

View all activity

Organizations

upvoted 2 articles about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 890

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 51

upvoted an article 3 months ago

Article

SigLIP 2: A better multilingual vision language encoder

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 211

upvoted 2 articles over 1 year ago

Article

LoRA training scripts of the world, unite!

linoyts, multimodalart

•

Jan 2, 2024

• 79

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

linoyts

•

Oct 21, 2024

• 42

liked 4 Spaces over 1 year ago

Stable Diffusion XL on TPUv5e

🏋

2.04k

Generate images from text prompts

Vision Arena (Testing VLMs side-by-side)

🖼

561

Explore AI-powered visual tasks in Vision Arena

Vision Papers

💻

114

All paper summaries read by Merve

VBench Leaderboard

📊

351

Submit video model evaluation results to a public benchmark

upvoted an article almost 2 years ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

pcuenq, sayakpaul

•

Jan 26, 2023

• 82

liked a Space about 2 years ago

IDEFICS2 Playground

🐨

169

Chat with a visual AI assistant using text and images

upvoted 2 articles about 2 years ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Leyo, HugoLaurencon, VictorSanh

•

Apr 15, 2024

• 191

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

New activity in Vincent-luo/hagrid-mediapipe-hands almost 3 years ago

SD Auto1111

#1 opened almost 3 years ago by

Pomad

updated a dataset almost 3 years ago

Vincent-luo/hagrid-mediapipe-hands

Viewer • Updated May 26, 2023 • 507k • 762 • 13

updated a model almost 3 years ago

Vincent-luo/controlnet-hands

Image-to-Image • Updated May 26, 2023 • 71 • 4

New activity in Vincent-luo/controlnet-hands almost 3 years ago

Upload controlnet as PyTorch weights

#2 opened almost 3 years ago by

patrickvonplaten

New activity in Vincent-luo/controlnet-hands about 3 years ago

Improve README and enable inference API

#1 opened about 3 years ago by

patrickvonplaten

liked 2 Spaces about 3 years ago

ColorCanny Controlnet

🐨

Controlnet Hands

🏢

115

Vincent Luo

AI & ML interests

Recent Activity

Organizations

Vincent-luo's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

How I contributed a new model to the Transformers library using Codex

SigLIP 2: A better multilingual vision language encoder

LoRA training scripts of the world, unite!

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

Stable Diffusion XL on TPUv5e

Vision Arena (Testing VLMs side-by-side)

Vision Papers

VBench Leaderboard

Using LoRA for Efficient Stable Diffusion Fine-Tuning

IDEFICS2 Playground

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Vision Language Models Explained

SD Auto1111

Upload controlnet as PyTorch weights

Improve README and enable inference API

ColorCanny Controlnet

Controlnet Hands