Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
1
19
Olivier
oliviermills
Follow
Itisissac's profile picture
gouthamr's profile picture
vikaswebdev's profile picture
7 followers
·
26 following
https://oliviermills.com
millsit
oliviermills
AI & ML interests
LLMs, Data, AI for non-profits
Recent Activity
liked
a dataset
about 21 hours ago
ministere-culture/comparia-conversations
commented
on
a paper
about 1 month ago
Beyond Release: Access Considerations for Generative AI Systems
reacted
to
lewtun
's
post
with 🔥
4 months ago
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open! 🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1. 🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code. 🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training. Follow along: https://github.com/huggingface/open-r1
View all activity
Organizations
oliviermills
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 21 hours ago
ministere-culture/comparia-conversations
Viewer
•
Updated
4 days ago
•
175k
•
226
•
39
liked
a dataset
11 months ago
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
Jan 24
•
60k
•
4.28k
•
454
liked
a model
11 months ago
numind/NuExtract-large
Text Generation
•
Updated
Jun 28, 2024
•
40
•
119
liked
6 models
about 1 year ago
tomasonjo/text2cypher-demo-16bit
Text Generation
•
Updated
May 17, 2024
•
12
•
24
jetmoe/jetmoe-8b
Text Generation
•
Updated
Apr 15, 2024
•
1.09k
•
246
mPLUG/DocOwl1.5
Updated
Apr 10, 2024
•
33
•
26
mPLUG/DocOwl1.5-stage1
Updated
Apr 10, 2024
•
1
•
11
mPLUG/DocOwl1.5-Chat
Updated
Apr 10, 2024
•
54
•
27
CohereLabs/aya-101
Text2Text Generation
•
Updated
Apr 15
•
26.2k
•
642
liked
10 models
over 1 year ago
togethercomputer/StripedHyena-Nous-7B
Text Generation
•
Updated
Mar 27, 2024
•
144
•
142
DiscoResearch/mixtral-7b-8expert
Text Generation
•
Updated
Dec 11, 2023
•
9.44k
•
264
microsoft/phi-2
Text Generation
•
Updated
Apr 29, 2024
•
624k
•
3.34k
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
Aug 19, 2024
•
382k
•
•
4.44k
DiscoResearch/DiscoLM-mixtral-8x7b-v2
Text Generation
•
Updated
Dec 13, 2023
•
59
•
124
Nexusflow/NexusRaven-V2-13B
Text Generation
•
Updated
23 days ago
•
3.91k
•
465
01-ai/Yi-34B
Text Generation
•
Updated
Nov 11, 2024
•
5.65k
•
1.3k
01-ai/Yi-6B
Text Generation
•
Updated
Nov 11, 2024
•
12.6k
•
372
HuggingFaceH4/zephyr-7b-alpha
Text Generation
•
Updated
Oct 16, 2024
•
12.5k
•
•
1.11k
Deci/DeciLM-6b
Text Generation
•
Updated
Jul 29, 2024
•
105
•
232