Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
N's picture
1 1 3

N

gorillaframeai
ยท

AI & ML interests

None yet

Recent Activity

reacted to loubnabnl's post with โค๏ธ 2 days ago
We've just published a detailed blog post on the creation of Cosmopedia dataset. We hope this will provide insights about generating synthetic data at scale for pre-training. https://huggingface.co/blog/cosmopedia Here are some key takeaways: ๐ŸŽฏ Prompt curation is crucial: we want to cover many topics with few duplicates. ๐Ÿ“š You can leverage various resources for diversity: using different seed data, generation formats, and target audiences. โš™๏ธ The importance of a good technical stack: for scalable generations with tools like llm-swarm and fast model training and evaluation. Have a good read!
upvoted a collection about 2 months ago
HiDream-I1
liked a Space about 2 months ago
VisualCloze/VisualCloze
View all activity

Organizations

Vladimir's profile picture

gorillaframeai 's datasets

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs