Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
3
N
gorillaframeai
Follow
0 followers
ยท
2 following
AI & ML interests
None yet
Recent Activity
reacted
to
loubnabnl
's
post
with โค๏ธ
2 days ago
We've just published a detailed blog post on the creation of Cosmopedia dataset. We hope this will provide insights about generating synthetic data at scale for pre-training. https://huggingface.co/blog/cosmopedia Here are some key takeaways: ๐ฏ Prompt curation is crucial: we want to cover many topics with few duplicates. ๐ You can leverage various resources for diversity: using different seed data, generation formats, and target audiences. โ๏ธ The importance of a good technical stack: for scalable generations with tools like llm-swarm and fast model training and evaluation. Have a good read!
upvoted
a
collection
about 2 months ago
HiDream-I1
liked
a Space
about 2 months ago
VisualCloze/VisualCloze
View all activity
Organizations
gorillaframeai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
collection
about 2 months ago
HiDream-I1
Collection
A collections of HiDream-I1 models.
โข
4 items
โข
Updated
Apr 8
โข
32