Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
13
2
236
Leanne
Twinwaffle
Follow
Mi6paulino's profile picture
asigalov61's profile picture
2 followers
·
8 following
AI & ML interests
None yet
Recent Activity
liked
a Space
4 days ago
Fabriwin/flux
reacted
to
burtenshaw
's
post
with 👍
4 days ago
I’m super excited to work with @mlabonne to build the first practical example in the reasoning course. 🔗 https://huggingface.co/reasoning-course Here's a quick walk through of the first drop of material that works toward the use case: - a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’ - Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works. - Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward. - Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with. Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.
liked
a Space
18 days ago
huchenchat/Shakker-Labs-SD3.5-LoRA-Chinese-Line-Art
View all activity
Organizations
spaces
1
pinned
Running
4
FLUX.1 Schnell Serverless
🔥
FLUX.1-Schnell on serverless inference, no GPU required
models
None public yet
datasets
None public yet