Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1168
89
543
Lewis Tunstall
PRO
lewtun
Follow
logicaldata's profile picture
piecurus's profile picture
angle's profile picture
814 followers
·
0 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked
a dataset
about 3 hours ago
ServiceNow-AI/R1-Distill-SFT
liked
a dataset
about 3 hours ago
open-thoughts/OpenThoughts-114k
upvoted
an
article
about 4 hours ago
Welcome to Inference Providers on the Hub 🔥
View all activity
Articles
Open-R1: a fully open reproduction of DeepSeek-R1
about 23 hours ago
•
221
Universal Assisted Generation: Faster Decoding with Any Assistant Model
Oct 29, 2024
•
52
Faster Assisted Generation with Dynamic Speculation
Oct 8, 2024
•
44
Llama can now see and run on your device - welcome Llama 3.2
Sep 25, 2024
•
182
FineVideo: behind the scenes
Sep 23, 2024
•
28
How NuminaMath Won the 1st AIMO Progress Prize
Jul 11, 2024
•
111
Welcome Gemma 2 - Google's new open LLM
Jun 27, 2024
•
126
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18, 2024
•
43
Mixture of Experts Explained
Dec 11, 2023
•
270
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Dec 11, 2023
•
11
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
Dec 6, 2023
•
6
Fine-tuning Llama 2 70B using PyTorch FSDP
Sep 13, 2023
•
16
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
9
Llama 2 is here - get it on Hugging Face
Jul 18, 2023
•
23
Can foundation models label data like humans?
Jun 12, 2023
•
1
The Falcon has landed in the Hugging Face ecosystem
Jun 5, 2023
•
12
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
26
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
Mar 9, 2023
•
37
Red-Teaming Large Language Models
Feb 24, 2023
•
21
Diffusion Models Live Event
Nov 25, 2022
Very Large Language Models and How to Evaluate Them
Oct 3, 2022
•
1
SetFit: Efficient Few-Shot Learning Without Prompts
Sep 26, 2022
•
22
Announcing Evaluation on the Hub
Jun 28, 2022
Organizations
lewtun
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 datasets
about 3 hours ago
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
about 2 hours ago
•
1.85M
•
50
•
35
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
about 5 hours ago
•
114k
•
29
liked
2 models
4 days ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
Updated
Nov 9, 2024
•
21.9k
•
33
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
Updated
Oct 24, 2024
•
1.24M
•
722
liked
2 models
5 days ago
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
Updated
5 days ago
•
7.27k
•
91
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
5 days ago
•
5.48k
•
74
liked
2 models
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
3 days ago
•
150k
•
517
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
3 days ago
•
189k
•
4.41k
liked
a Space
22 days ago
Running
34
🥇
MEGA-Bench
A leaderboard for multimodal models
liked
a model
22 days ago
HuggingFaceTB/FineMath-Llama-3B
Updated
22 days ago
•
218
•
13
liked
a dataset
23 days ago
HuggingFaceH4/MATH-500
Viewer
•
Updated
Nov 15, 2024
•
500
•
13k
•
53
liked
a model
23 days ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
4 days ago
•
325k
•
2.63k
liked
a model
26 days ago
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B
Text Classification
•
Updated
Nov 27, 2024
•
1.32k
•
24
liked
a Space
28 days ago
Running
429
📈
2024 AI Timeline
liked
a model
30 days ago
deepseek-ai/DeepSeek-V3-Base
Updated
5 days ago
•
21.8k
•
1.42k
liked
a Space
about 1 month ago
Running
239
🏃
Jupyter Agent
liked
2 models
about 1 month ago
answerdotai/ModernBERT-large
Fill-Mask
•
Updated
13 days ago
•
2.04M
•
335
answerdotai/ModernBERT-base
Fill-Mask
•
Updated
13 days ago
•
4.78M
•
713
liked
a dataset
about 1 month ago
HuggingFaceTB/finemath
Viewer
•
Updated
Dec 23, 2024
•
48.3M
•
30.8k
•
269
liked
a Space
about 1 month ago
Running
486
📈
Scaling test-time compute
Load more