Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

upvoted an article 2 days ago

Building the Hugging Face MCP Server

upvoted an article 2 days ago

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

updated a Space 4 days ago

sayakpaul/serialize-flux-aot

View all activity

Organizations

upvoted 2 articles 2 days ago

Article

Building the Hugging Face MCP Server

By

and 3 others •

2 days ago

• 33

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

3 days ago

• 496

upvoted a paper 10 days ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published 18 days ago • 38

upvoted an article 20 days ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

By

and 4 others •

23 days ago

• 75

upvoted an article about 1 month ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 724

upvoted 2 articles about 2 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

By

•

Feb 14, 2020

• 41

Article

Exploring Quantization Backends in Diffusers

By

and 2 others •

May 21

• 38

upvoted an article 2 months ago

Article

Welcoming Llama Guard 4 on Hugging Face Hub

By

and 3 others •

Apr 29

• 38

upvoted 2 articles 3 months ago

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

By

and 3 others •

Oct 23, 2024

• 16

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted 2 papers 3 months ago

A Refined Analysis of Massive Activations in LLMs

Paper • 2503.22329 • Published Mar 28 • 14

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 90

upvoted a collection 4 months ago

SANA-1.5

SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Apr 17 • 6

upvoted 3 articles 4 months ago

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

By

•

Apr 5, 2022

• 36

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 445

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

By

and 2 others •

Jun 24, 2024

• 197

upvoted an article 5 months ago

Article

Distilling from Dialogues: Finding Meaning in LLM Interactions

By

•

Feb 25

• 4

upvoted a collection 5 months ago

Remote VAE Inference Endpoints

Models and handler code used in https://huggingface.co/blog/remote_vae • 5 items • Updated Mar 10 • 6

upvoted 2 articles 5 months ago

Article

Remote VAEs for decoding with HF endpoints 🤗

By

and 1 other •

Feb 24

• 40

Article

SigLIP 2: A better multilingual vision language encoder

By

and 2 others •

Feb 21

• 174