Hugging Face Smol Models Research

Enterprise
community
Activity Feed

AI & ML interests

Exploring smol models (for text, vision and video) and high quality web and synthetic datasets

Recent Activity

HuggingFaceTB's activity

fdaudensĀ 
posted an update about 9 hours ago
view post
Post
279
Did we just drop personalized AI evaluation?! This tool auto-generates custom benchmarks on your docs to test which models are the best.

Most benchmarks test general capabilities, but what matters is how models handle your data and tasks. YourBench helps answer critical questions like:
- Do you really need a hundreds-of-billions-parameter model sledgehammer to crack a nut?
- Could a smaller, fine-tuned model work better?
- How well do different models understand your domain?

Some cool features:
šŸ“š Generates custom benchmarks from your own documents (PDFs, Word, HTML)
šŸŽÆ Tests models on real tasks, not just general capabilities
šŸ”„ Supports multiple models for different pipeline stages
šŸ§  Generate both single-hop and multi-hop questions
šŸ” Evaluate top models and deploy leaderboards instantly
šŸ’° Full cost analysis to optimize for your budget
šŸ› ļø Fully configurable via a single YAML file

26 SOTA models tested for question generation. Interesting finding: Qwen2.5 32B leads in question diversity, while smaller Qwen models and Gemini 2.0 Flash offer great value for cost.

You can also run it locally on any models you want.

I'm impressed. Try it out: yourbench/demo
fdaudensĀ 
posted an update 3 days ago
view post
Post
1571
šŸ”„ DeepSeek vibe coding with DeepSite is going viral with awesome projects!

From games to stunning visualizations, 7 wild examples:

šŸ“ŗ AI TV with custom channels and animations https://x.com/_akhaliq/status/1905747381951545647

šŸš€ Earth to Moon spacecraft journey visualization
Watch this incredible Three.js space simulation with zero external assets:
https://x.com/_akhaliq/status/1905836902533451999

šŸ’£ Minesweeper in 2.5 minutes! Built & deployed instantly on DeepSite. Zero setup needed:
https://x.com/cholf5/status/1906031928937218334

šŸŽ® Asked for Game of Life, got a masterpiece. Simple prompt, complex features. See it in action: https://x.com/pbeyssac/status/1906304454824992844

šŸ’« One-shot anime website with perfect UI. DeepSite turned a simple request into a fully-functional anime site: https://x.com/risphereeditor/status/1905961725028913264

šŸ“Š 10-minute World Indicators Dashboard. Just described what I wanted and got a full interactive dashboard! https://x.com/i/status/1906345214089785634

āœØ Ready to build without coding? Imagine it. Build it. Share it! enzostvs/deepsite
thomwolfĀ 
posted an update 4 days ago
view post
Post
2661
The new DeepSite space is really insane for vibe-coders
enzostvs/deepsite

With the wave of vibe-coding-optimized LLMs like the latest open-source DeepSeek model (version V3-0324), you can basically prompt out-of-the-box and create any app and game in one-shot.

It feels so powerful to me, no more complex framework or under-the-hood prompt engineering to have a working text-to-app tool.

AI is eating the world and *open-source* AI is eating AI itself!

PS: and even more meta is that the DeepSite app and DeepSeek model are both fully open-source code => time to start recursively improve?

PPS: you still need some inference hosting unless you're running the 600B param model at home, so check the very nice list of HF Inference Providers for this model: deepseek-ai/DeepSeek-V3-0324
  • 1 reply
Ā·
fdaudensĀ 
posted an update 4 days ago
view post
Post
2005
Want to vibecode with DeepSeek? Just spent 10 minutes with this space and created a full world indicators dashboard - literally just by describing what I wanted!

Anyone can now prototype and deploy projects instantly.

Try out the app: enzostvs/deepsite

My dashboard: fdaudens/world-indicators
fdaudensĀ 
posted an update 7 days ago
view post
Post
1888
Want to ramp up your AI skills and start breaking bigger stories? With the Journalists on Hugging Face community, we're launching our first learn-together course!

We'll build AI classifiers that process months of data in minutes. How?

- Work through an interactive version of an excellent course developed by Ben Welsh and Derek Willis
- Share findings and get help in our dedicated community channel
- Build working classifiers you can use in your reporting today

No coding background needed - if you can write a ChatGPT or Claude prompt, you can do this. Journalists are already using these techniques to break stories, from uncovering hidden real estate deals to tracking unusual campaign spending.

Join usā€”it might give you your next big story!

Thanks to Ben and Derek for letting me adapt their excellent course into this interactive version!

- Check out the course: JournalistsonHF/first-llm-classifier

- Join our Slack community to learn together: https://docs.google.com/forms/d/e/1FAIpQLSfyA7G6Y9q-5hDBSnGc3CFtg9H8fjqKCCuieptXuTqRudGNjQ/viewform
freddyaboultonĀ 
posted an update 8 days ago
view post
Post
1297
Ever wanted to share your AI creations with friends? āœØ

Screenshots are fine, but imagine letting others play with your ACTUAL model!

Introducing Gradio deep links šŸ”— - now you can share interactive AI apps, not just images.

Add a gr.DeepLinkButton to any app and get shareable URLs that let ANYONE experiment with your models.

merveĀ 
posted an update 12 days ago
view post
Post
3490
So many open releases at Hugging Face past week šŸ¤Æ recapping all here ā¤µļø merve/march-21-releases-67dbe10e185f199e656140ae

šŸ‘€ Multimodal
> Mistral AI released a 24B vision LM, both base and instruction FT versions, sota šŸ”„ (OS)
> with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS)
> SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants
> SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS)

šŸ’¬ LLMs
> NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset
> LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B
> Dataset: Glaive AI released a new reasoning dataset of 22M+ examples
> Dataset: NVIDIA released new helpfulness dataset HelpSteer3
> Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS)
> Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B
> Dataset: GeneralThought-430K is a new reasoning dataset (OS)

šŸ–¼ļø Image Generation/Computer Vision
> Roboflow released RF-DETR, new real-time sota object detector (OS) šŸ”„
> YOLOE is a new real-time zero-shot object detector with text and visual prompts šŸ„¹
> Stability AI released Stable Virtual Camera, a new novel view synthesis model
> Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model
> ByteDance released InfiniteYou, new realistic photo generation model
> StarVector is a new 8B model that generates svg from images
> FlexWorld is a new model that expands 3D views (OS)

šŸŽ¤ Audio
> Sesame released CSM-1B new speech generation model (OS)

šŸ¤– Robotics
> NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset

*OS ones have Apache 2.0 or MIT license
fdaudensĀ 
posted an update 13 days ago
view post
Post
2092
šŸŽ„ Just tested Stability AI's Stable Virtual Camera - it turns a single photo into dynamic video with AI-powered camera movements! From static meeting room to cinematic sweeps. šŸš€

Try it out: stabilityai/stable-virtual-camera
fdaudensĀ 
posted an update 14 days ago
view post
Post
1939
šŸ”Š Meet Orpheus: A breakthrough open-source TTS model that matches human-level speech with empathy & emotion.
- Available in 4 sizes (150M-3B parameters)
- delivers ultra-fast streaming
- zero-shot voice cloning.
- Apache 2.0 license

canopylabs/orpheus-tts-67d9ea3f6c05a941c06ad9d2
  • 1 reply
Ā·
fdaudensĀ 
posted an update 16 days ago
view post
Post
2285
Want to build useful newsroom tools with AI? Weā€™re launching a Hugging Face x Journalism Slack channel where journalists turn AI concepts into real newsroom solutions.

Inside the community:
āœ… Build open-source AI tools for journalism
āœ… Get direct help from the community
āœ… Stay updated on new models and datasets
āœ… Learn from other journalistsā€™ experiments and builds

The goal? Go from ā€œI read about AIā€ to ā€œI built an AI tool that supercharged my newsroom.ā€ ā€”no more learning in isolation.

Join us! https://join.slack.com/t/journalistson-tnd8294/shared_invite/zt-30vsmhk4w-dZpeMOoxdhCvfNsqtspPUQ (Please make sure to use a clear identityā€”no teddybear85, for example šŸ˜‰)

(If you know people who might be interested, tag them below! The more minds we bring in, the better the tools we build.)

fdaudensĀ 
posted an update 16 days ago
fdaudensĀ 
posted an update 20 days ago
view post
Post
892
šŸ¤Æ Gemma 3's image analysis blew me away!

Tested 2 ways to extract airplane registration numbers from photos with 12B model:

1ļøāƒ£ Gradio app w/API link (underrated feature IMO) + ZeroGPU infra on Hugging Face in Google Colab. Fast & free.

2ļøāƒ£ LMStudio + local processing (100% private). Running this powerhouse on a MacBook w/16GB RAM is wild! šŸš€

Colab: https://colab.research.google.com/drive/1YmmaP0IDEu98CLDppAAK9kbQZ7lFnLZ1?usp=sharing
fdaudensĀ 
posted an update 21 days ago
view post
Post
1474
Ever wanted 45 min with one of AIā€™s most fascinating minds? Was with @thomwolf at HumanX Vegas. Sharing my notes of his Q&A with the pressā€”completely changed how I think about AIā€™s future:

1ļøāƒ£ The next wave of successful AI companies wonā€™t be defined by who has the best model but by who builds the most useful real-world solutions. "We all have engines in our cars, but thatā€™s rarely the only reason we buy one. We expect it to work well, and thatā€™s enough. LLMs will be the same."

2ļøāƒ£ Big players are pivoting: "Closed-source companiesā€”OpenAI being the firstā€”have largely shifted from LLM announcements to product announcements."

3ļøāƒ£ Open source is changing everything: "DeepSeek was open source AIā€™s ChatGPT moment. Basically, everyone outside the bubble realized you can get a model for freeā€”and itā€™s just as good as the paid ones."

4ļøāƒ£ Product innovation is being democratized: Take Manus, for exampleā€”they built a product on top of Anthropicā€™s models thatā€™s "actually better than Anthropicā€™s own product for now, in terms of agents." This proves that anyone can build great products with existing models.

Weā€™re entering a "multi-LLM world," where models are becoming commoditized, and all the tools to build are readily availableā€”just look at the flurry of daily new releases on Hugging Face.

Thom's comparison to the internet era is spot-on: "In the beginning you made a lot of money by making websites... but nowadays the huge internet companies are not the companies that built websites. Like Airbnb, Uber, Facebook, they just use the internet as a medium to make something for real life use cases."

Love to hear your thoughts on this shift!
  • 1 reply
Ā·
thomwolfĀ 
posted an update 22 days ago
view post
Post
2714
We've kept pushing our Open-R1 project, an open initiative to replicate and extend the techniques behind DeepSeek-R1.

And even we were mind-blown by the results we got with this latest model we're releasing: āš”ļøOlympicCoder ( open-r1/OlympicCoder-7B and open-r1/OlympicCoder-32B)

It's beating Claude 3.7 on (competitive) programming ā€“a domain Anthropic has been historically really strong atā€“ and it's getting close to o1-mini/R1 on olympiad level coding with just 7B parameters!

And the best part is that we're open-sourcing all about its training dataset, the new IOI benchmark, and more in our Open-R1 progress report #3: https://huggingface.co/blog/open-r1/update-3

Datasets are are releasing:
- open-r1/codeforces
- open-r1/codeforces-cots
- open-r1/ioi
- open-r1/ioi-test-cases
- open-r1/ioi-sample-solutions
- open-r1/ioi-cots
- open-r1/ioi-2024-model-solutions