Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
daqc
's Collections
Dataset Best Practices
LRMs
Agents
Thinkers
Low-Resource Data
Reasoning LLMs
Multilingual
Read later
SLMs
Safety
Reinforcement
on-Device (phone)
Frameworks
Domain-specific
Dataset Best Practices
updated
Jan 27
Upvote
-
Towards Best Practices for Open Datasets for LLM Training
Paper
•
2501.08365
•
Published
Jan 14
•
56
Upvote
-
Share collection
View history
Collection guide
Browse collections