Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 1 hour ago

open-r1/open-r1-eval-leaderboard

upvoted an article about 23 hours ago

CodeAgents + Structure: A Better Way to Execute Actions

liked a model about 23 hours ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

lewtun's activity

upvoted an article about 23 hours ago

Article

CodeAgents + Structure: A Better Way to Execute Actions

By

and 1 other •

May 28, 2024

• 21

upvoted a collection 3 days ago

Step 1: Reproducing DeepSeek's Distilled Models

Code for training and evaluation: https://github.com/huggingface/open-r1 • 3 items • Updated 3 days ago • 1

upvoted an article 8 days ago

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

By

and 5 others •

8 days ago

• 25

upvoted an article 9 days ago

Article

NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning

By

and 1 other •

10 days ago

• 24

upvoted an article 12 days ago

Article

TinyAgents: A Minimal Experiment with Code Agents and MCP Tools

By

•

13 days ago

• 29

upvoted 2 articles 13 days ago

Article

The 4 Things Qwen-3's Chat Template Teaches Us

By

•

30 days ago

• 47

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

15 days ago

• 104

upvoted a paper 16 days ago

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published 17 days ago • 11

upvoted a paper 17 days ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 5

upvoted an article 21 days ago

Article

Page-to-Video: Generate videos from webpages 🪄🎬

By

•

23 days ago

• 27

upvoted an article 29 days ago

Article

How to Build an MCP Server with Gradio

By

and 1 other •

30 days ago

• 127

upvoted 2 articles about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 262

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

By

and 3 others •

Jun 13, 2024

• 54

upvoted an article about 2 months ago

Article

Empowering Public Organizations: Preparing Your Data for the AI Era

By

and 1 other •

Apr 10

• 15

upvoted a collection about 2 months ago

Cogito v1 Preview

5 items • Updated Apr 8 • 111

upvoted 2 papers about 2 months ago

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Paper • 2504.06122 • Published Apr 8 • 6

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 185

upvoted 3 articles about 2 months ago

Article

Enabling Long Context Training with Sequence Parallelism in Axolotl

By

and 1 other •

Apr 4

• 8

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

By

and 1 other •

Apr 3

• 13

Article

Querying Hugging Face Datasets with the DuckDB UI

By

•

Apr 3

• 16