Naman Anand

naman5a

1 42 12

AI & ML interests

RAG , LLMs

Recent Activity

upvoted an article about 1 month ago

Migrating Your GitHub CI to Hugging Face Jobs

upvoted an article about 2 months ago

Training and Finetuning Embedding Models with Sentence Transformers

upvoted an article 2 months ago

Welcome Gemma 4: Frontier multimodal intelligence on device

View all activity

Organizations

upvoted an article about 1 month ago

Article

Migrating Your GitHub CI to Hugging Face Jobs

abidlabs

•

Jun 9

• 11

upvoted an article about 2 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 275

upvoted an article 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 914

upvoted an article 4 months ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c

•

Feb 4

• 90

upvoted an article 6 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 211

upvoted an article 7 months ago

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted a paper 7 months ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 121

upvoted 2 articles 7 months ago

Article

Automatic Prompt Optimization with DSPy and Cross Encoders

dleemiller

•

Aug 2, 2025

• 5

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

upvoted 3 articles 8 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 422

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

Article

20x Faster TRL Fine-tuning with RapidFire AI

kbigdelysh, arunkk09, qgallouedec

•

Nov 21, 2025

• 27

upvoted a collection 11 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 110

upvoted a paper 11 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted 4 articles about 1 year ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

julien-c

•

Feb 14, 2020

• 62

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131

•

Apr 16, 2025

• 42

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

danaaubakirova, Molbap, mshukor, cadene

•

Feb 4, 2025

• 192

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 748

upvoted a paper about 1 year ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 97

upvoted a collection about 1 year ago

GLM-4-0414

Collection

GLM-4-0414 series model • 6 items • Updated Mar 2 • 135

Naman Anand

AI & ML interests

Recent Activity

Organizations

naman5a's activity

Migrating Your GitHub CI to Hugging Face Jobs

Training and Finetuning Embedding Models with Sentence Transformers

Welcome Gemma 4: Frontier multimodal intelligence on device

Community Evals: Because we're done trusting black-box leaderboards over the community

🪆 Introduction to Matryoshka Embedding Models

The Optimal Architecture for Small Language Models

Automatic Prompt Optimization with DSPy and Cross Encoders

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

20x Faster TRL Fine-tuning with RapidFire AI

How to train a new language model from scratch using Transformers and Tokenizers

Introducing HELMET: Holistically Evaluating Long-context Language Models

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Finally, a Replacement for BERT: Introducing ModernBERT