nyuuzyou's picture

nyuuzyou PRO

nyuuzyou

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

Social Post Explorers's profile picture AI Starter Pack's profile picture

nyuuzyou's activity

reacted to fdaudens's post with ❤️ 1 day ago
view post
Post
4175
Yes, DeepSeek R1's release is impressive. But the real story is what happened in just 7 days after:

- Original release: 8 models, 540K downloads. Just the beginning...

- The community turned those open-weight models into +550 NEW models on Hugging Face. Total downloads? 2.5M—nearly 5X the originals.

The reason? DeepSeek models are open-weight, letting anyone build on top of them. Interesting to note that the community focused on quantized versions for better efficiency & accessibility. They want models that use less memory, run faster, and are more energy-efficient.

When you empower builders, innovation explodes. For everyone. 🚀

The most popular community model? @bartowski 's DeepSeek-R1-Distill-Qwen-32B-GGUF version — 1M downloads alone.
  • 2 replies
·
reacted to clem's post with 🤗 1 day ago
view post
Post
3568
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!
reacted to hexgrad's post with ❤️ 3 days ago
view post
Post
2853
IMHO, being able & willing to defeat CAPTCHA, hCaptcha, or any other reasoning puzzle is a must-have for any Web-Browsing / Computer-Using Agent (WB/CUA).

I realize it subverts the purpose of CAPTCHA, but I do not think you can claim to be building AGI/agents without smoothly passing humanity checks. It would be like getting in a self-driving car that requires human intervention over speed bumps. Claiming AGI or even "somewhat powerful AI" seems hollow if you are halted by a mere CAPTCHA.

I imagine OpenAI's Operator is *able* but *not willing* to defeat CAPTCHA. Like their non-profit status, I expect that policy to evolve over time—and if not, rival agent-builders will attack that opening to offer a better product.
  • 2 replies
·
posted an update 5 days ago
view post
Post
372
🤗Emojis Dataset - nyuuzyou/emojis

A collection of metadata for 3,264,372 AI-generated emoji images featuring:
- URLs to AI-generated emoji artwork images
- Links to both full-resolution transparent PNGs and compressed WebP formats
- Unique identifiers and slugs for each emoji entry
- Original prompts
posted an update 8 days ago
view post
Post
1460
🤖 Begemot.ai Dataset - nyuuzyou/begemot

A collection of 2,728,999 AI-generated educational projects featuring:
- Comprehensive Russian language educational content
- Complete project metadata including titles, descriptions and chapters
- Educational project descriptions and content
- Direct URLs to project pages
- Project titles and detailed descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.
New activity in nyuuzyou/artfol 10 days ago
posted an update 11 days ago
view post
Post
1673
🎨 Artfol Dataset - nyuuzyou/artfol

A collection of 1,892,816 artwork posts featuring:
- High-quality art pieces with various styles and techniques
- Complete metadata including artist IDs, titles, and moderation flags
- Content from Artfol social media platform

The dataset contains:
- Public domain artwork posts
- Artist attribution and identifiers
- Direct image URLs and web page links
- Content safety flags (NSFW, gore)
- Post titles and descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.
New activity in nyuuzyou/rule34world 15 days ago
New activity in nyuuzyou/furbooru 18 days ago
posted an update 19 days ago
view post
Post
1501
🗂️ I don't think the collections feature of Hugging Face is widely used, even though it's an excellent way to organize and discover interesting resources. To do my bit to change that, I've created two carefully curated collections that combine both my original work and other valuable datasets:

Educational Datasets
- Mostly English-Russian, but other languages are also included
- Extended by my new Begemot.ai dataset (2.7M+ Russian education records) nyuuzyou/begemot

Link: nyuuzyou/educational-datasets-677c268978ac1cec96cc3605

Anime & Art

- Extensive art-focused collection, including my new datasets:
- Buzzly.art (2K artworks) nyuuzyou/buzzlyart
- Paintberri (60K+ pieces) nyuuzyou/paintberri
- Itaku.ee (924K+ items) nyuuzyou/itaku
- Extended with other amazing datasets from the community

Link: nyuuzyou/anime-and-art-677ae996682a389fccd892c3

Collections should become a more common feature - hopefully this will encourage others to create and share their own curated collections. By organizing related datasets into these themed collections, I hope to make it easier for researchers and developers to discover and use these valuable resources.
  • 1 reply
·