2 33 50

Yassine Boukhari

Yasbok

AI & ML interests

NLP, Generative models, Reinforcement Learning

Recent Activity

upvoted an article about 2 months ago

Building the Hugging Face MCP Server

upvoted an article about 2 months ago

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

upvoted an article about 2 months ago

Transformers backend integration in SGLang

View all activity

Organizations

upvoted 4 articles about 2 months ago

Article

Building the Hugging Face MCP Server

and 3 others •

Jul 10

• 60

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

and 3 others •

May 23

• 158

Article

Transformers backend integration in SGLang

and 4 others •

Jun 23

• 53

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 638

upvoted an article 2 months ago

Article

StarCoder: A State-of-the-Art LLM for Code

and 1 other •

May 4, 2023

• 62

upvoted an article 3 months ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 349

upvoted a paper 4 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 37

upvoted an article 5 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

and 6 others •

Apr 5

• 146

upvoted a paper 6 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

upvoted 5 articles 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.29k

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 878

Article

Assisted Generation: a new direction toward low-latency text generation

•

May 11, 2023

• 71

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

and 1 other •

Jan 16

• 75

upvoted a paper 8 months ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 63

upvoted an article 9 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 140

upvoted 2 articles about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

•

May 16, 2024

• 50

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 157

upvoted 2 articles over 1 year ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

Jun 3, 2024

• 48

Article

Mixture of Depth is Vibe

•

Apr 22, 2024

• 48

Yassine Boukhari

AI & ML interests

Recent Activity

Organizations

Yasbok's activity

Building the Hugging Face MCP Server

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Transformers backend integration in SGLang

SmolLM3: smol, multilingual, long-context reasoner

StarCoder: A State-of-the-Art LLM for Code

You could have designed state of the art positional encoding

Welcome Llama 4 Maverick & Scout on Hugging Face!

Open-source DeepResearch – Freeing our search agents

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Assisted Generation: a new direction toward low-latency text generation

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Use Models from the Hugging Face Hub in LM Studio

Unlocking Longer Generation with Key-Value Cache Quantization

🪆 Introduction to Matryoshka Embedding Models

Mergoo: Efficiently Build Your Own MoE LLM

Mixture of Depth is Vibe