sdtana's picture

sdtana

sdtana

·

roxani_17

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Group Downsampling with Equivariant Anti-aliasing

upvoted a paper 12 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

upvoted a paper 12 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

View all activity

Organizations

sdtana's activity

upvoted a paper 3 days ago

Group Downsampling with Equivariant Anti-aliasing

Paper • 2504.17258 • Published 9 days ago • 7

upvoted 2 papers 12 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13 • 38

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 57

upvoted a paper 13 days ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 161

upvoted 2 papers 14 days ago

Gaussian Mixture Flow Matching Models

Paper • 2504.05304 • Published 25 days ago • 12

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published 18 days ago • 20

upvoted a paper 17 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 18 days ago • 251

upvoted a paper 19 days ago

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published 22 days ago • 19

upvoted an article 26 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 237

upvoted a paper about 1 month ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 49

upvoted a paper 2 months ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

upvoted 4 papers 3 months ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11 • 45

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 38

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Paper • 2502.01105 • Published Feb 3 • 20

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Paper • 2502.01639 • Published Feb 3 • 25