Vaibhav Singh

veb-101

veb-101

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

upvoted a paper 12 days ago

MolmoAct: Action Reasoning Models that can Reason in Space

upvoted a paper 14 days ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

View all activity

Organizations

None yet

upvoted a paper 9 days ago

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Paper • 2508.10893 • Published 11 days ago • 30

upvoted a paper 12 days ago

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published 14 days ago • 39

upvoted a paper 14 days ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published 19 days ago • 57

upvoted an article about 1 month ago

Article

Efficient MultiModal Data Pipeline

and 4 others •

Jul 8

• 53

upvoted a paper 2 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 40

upvoted an article 2 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 157

upvoted a paper 5 months ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

upvoted a paper 6 months ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9 • 41

upvoted a paper 9 months ago

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12, 2024 • 13

upvoted a collection 10 months ago

Cosmos-Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated 11 days ago • 41

upvoted a paper 11 months ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2, 2024 • 41

upvoted an article about 1 year ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

and 5 others •

Aug 12, 2024

• 113

upvoted a collection about 1 year ago

MobileNetV4 pretrained weights

Collection

Weights for MobileNet-V4 pretrained in timm • 17 items • Updated 24 days ago • 19

upvoted 2 papers about 1 year ago

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12, 2024 • 26

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted an article over 1 year ago

Article

MobileNet-V4 (now in timm)

•

Jun 17, 2024

• 48

upvoted 4 papers over 1 year ago

Vaibhav Singh

AI & ML interests

Recent Activity

Organizations

veb-101's activity

Efficient MultiModal Data Pipeline

🪆 Introduction to Matryoshka Embedding Models

Welcome FalconMamba: The first strong attention-free 7B model

MobileNet-V4 (now in timm)