Sourabh Dey

GenDey

AI & ML interests

LLM fine tuning, LoRa, AI agents

Recent Activity

upvoted an article about 1 month ago

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

commented on an article about 2 months ago

Why Maybe We're Measuring LLM Compression Wrong

upvoted an article about 2 months ago

Why Maybe We're Measuring LLM Compression Wrong

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 38

commented on Why Maybe We're Measuring LLM Compression Wrong about 2 months ago

Thanks for replying and recognizing! This article did opened a lot of new ideas and perspective I used to have about quantization!
Thanks again!

upvoted an article about 2 months ago

Article

Why Maybe We're Measuring LLM Compression Wrong

•

Jun 21

• 9

commented on Why Maybe We're Measuring LLM Compression Wrong about 2 months ago

Thank you so much for this great article!
For jotting down all the concepts in a very simple manner!
But I have one question or rather analogy to be asked -

When we talk about KLD, we certainly check the distribution as a whole, not some distance between the vectors or likewise,
When we say about forward KLD, the mode/peek covering behaviour stands out. So while quantization, when you say we preserve the quantized model like the original one, are going with that mode covering behaviour?

upvoted an article 2 months ago

Article

KV Cache from scratch in nanoVLM

and 4 others •

Jun 4

• 89

updated a Space 4 months ago

DemoFaceDetectionApp

💻

For Testing this HuggingFace Spaces

published a Space 4 months ago

DemoFaceDetectionApp

💻

For Testing this HuggingFace Spaces

upvoted an article 5 months ago

Article

What changed in the Transformer architecture

•

Mar 8

• 15

updated a model about 1 year ago

GenDey/SynBot_Model

Updated Aug 10, 2024

Sourabh Dey

AI & ML interests

Recent Activity

Organizations

GenDey's activity

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Why Maybe We're Measuring LLM Compression Wrong

KV Cache from scratch in nanoVLM

DemoFaceDetectionApp

DemoFaceDetectionApp

What changed in the Transformer architecture