34 96 228

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

upvoted an article about 20 hours ago

🪆 Introduction to Matryoshka Embedding Models

liked a Space 8 days ago

clem/deepseek-ai-DeepSeek-V3-0324

reacted to Kseniase's post with ❤️ 10 days ago

View all activity

Organizations

damerajee's activity

upvoted an article about 20 hours ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 95

upvoted a paper 18 days ago

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published about 1 month ago • 8

upvoted a collection 20 days ago

BD3-LMs

Collection

https://m-arriola.com/bd3lms/ • 4 items • Updated 21 days ago • 20

upvoted 3 papers about 1 month ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 169

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20 • 13

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 150

upvoted a paper about 2 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 30

upvoted an article about 2 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 128

upvoted 2 papers 2 months ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published Jan 30 • 19

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 28

upvoted an article 3 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 45

upvoted 6 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 272

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 98