Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Olivier's picture
7 1 19

Olivier

oliviermills
Stevens's profile picture HyperAutomata's profile picture denisfitz's profile picture
ยท
https://oliviermills.com
  • millsit
  • oliviermills

AI & ML interests

LLMs, Data, AI for non-profits

Recent Activity

liked a dataset about 16 hours ago
ministere-culture/comparia-conversations
commented on a paper about 1 month ago
Beyond Release: Access Considerations for Generative AI Systems
reacted to lewtun's post with ๐Ÿ”ฅ 4 months ago
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open! ๐Ÿงช Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1. ๐Ÿง  Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code. ๐Ÿ”ฅ Step 3: show we can go from base model -> SFT -> RL via multi-stage training. Follow along: https://github.com/huggingface/open-r1
View all activity

Organizations

Baobab Tech's profile picture

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs