Saining Xie's picture

1 9 1

Saining Xie

sainx

·

sainingxie

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

authored a paper 9 days ago

Transition Matching Distillation for Fast Video Generation

upvoted a paper about 2 months ago

Flow Map Distillation Without Data

View all activity

Organizations

upvoted a paper 3 days ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published 3 days ago • 46

authored a paper 9 days ago

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 11 days ago • 31

upvoted a paper about 2 months ago

Flow Map Distillation Without Data

Paper • 2511.19428 • Published Nov 24, 2025 • 5

authored 17 papers 2 months ago

Deeply-Supervised Nets

Paper • 1409.5185 • Published Sep 18, 2014

Holistically-Nested Edge Detection

Paper • 1504.06375 • Published Apr 24, 2015 • 1

Aggregated Residual Transformations for Deep Neural Networks

Paper • 1611.05431 • Published Nov 16, 2016 • 2

Demystifying CLIP Data

Paper • 2309.16671 • Published Sep 28, 2023 • 20

Sample-Efficient Neural Architecture Search by Learning Action Space

Paper • 1906.06832 • Published Jun 17, 2019

Momentum Contrast for Unsupervised Visual Representation Learning

Paper • 1911.05722 • Published Nov 13, 2019 • 2

Going Denser with Open-Vocabulary Part Segmentation

Paper • 2305.11173 • Published May 18, 2023 • 2

Image Sculpting: Precise Object Editing with 3D Geometry Control

Paper • 2401.01702 • Published Jan 2, 2024 • 20

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Paper • 2401.06209 • Published Jan 11, 2024

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 6

V-IRL: Grounding Virtual Intelligence in Real Life

Paper • 2402.03310 • Published Feb 5, 2024 • 16

Masked Feature Prediction for Self-Supervised Visual Pre-Training

Paper • 2112.09133 • Published Dec 16, 2021

SLIP: Self-supervision meets Language-Image Pre-training

Paper • 2112.12750 • Published Dec 23, 2021 • 1

A ConvNet for the 2020s

Paper • 2201.03545 • Published Jan 10, 2022 • 2

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 18

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders

Paper • 2301.00808 • Published Jan 2, 2023

CiT: Curation in Training for Effective Vision-Language Data

Paper • 2301.02241 • Published Jan 5, 2023