Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
81
10
15
Guilherme Penedo
guipenedo
Follow
Neon720's profile picture
Flair-ai's profile picture
Jikand's profile picture
710 followers
·
6 following
gui_penedo
guipenedo
AI & ML interests
None yet
Articles
FineWeb2-C: Help Build Better Language Models in Your Language
3 days ago
•
10
Organizations
guipenedo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
18 days ago
HuggingFaceFW/fineweb-2
Viewer
•
Updated
18 days ago
•
13.8B
•
86.4k
•
361
liked
a Space
30 days ago
Running
32
💬
Discussion Forum
liked
a model
about 2 months ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
22 days ago
•
94.1k
•
443
liked
a Space
2 months ago
Running
47
📝
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
liked
a Space
3 months ago
Running
93
📖
TxT360: Trillion Extracted Text
liked
a model
3 months ago
cis-lmu/glotlid
Text Classification
•
Updated
Oct 26
•
8.14k
•
51
liked
a dataset
3 months ago
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
34.7k
•
826
liked
a Space
5 months ago
Running
362
🧽
Finegrain Object Eraser
Erase any object just by naming it!
liked
2 models
5 months ago
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
Updated
Oct 16
•
10.2k
•
164
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
Updated
Aug 18
•
44.8k
•
107
liked
a model
6 months ago
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14
•
2.85k
•
321
liked
a dataset
7 months ago
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
6 days ago
•
3B
•
329k
•
571
liked
a Space
7 months ago
Running
548
🍷
FineWeb: decanting the web for the finest text data at scale
liked
a dataset
8 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
7 days ago
•
48.4B
•
245k
•
1.78k
liked
a Space
about 1 year ago
Running
206
🚀
GPT Baker