Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

updated a Space 2 days ago
mrdbourke/LLMDet-demo
liked a Space 2 days ago
mrdbourke/LLMDet-demo
published a Space 2 days ago
mrdbourke/LLMDet-demo
View all activity

Organizations

None yet

mrdbourke's activity

reacted to merve's post with 🚀 2 days ago
view post
Post
1335
Yesterday was the day of vision language action models (VLAs)!

> SmolVLA: open-source small VLA for robotics by Hugging Face LeRobot team 🤖
Blog: https://huggingface.co/blog/smolvla
Model: lerobot/smolvla_base

> Holo-1: 3B & 7B web/computer use agentic VLAs by H Company 💻
Model family: Hcompany/holo1-683dd1eece7eb077b96d0cbd
Demo: https://huggingface.co/spaces/multimodalart/Holo1
Blog: https://huggingface.co/blog/Hcompany/holo1
super exciting times!!
reacted to Xenova's post with 🔥 2 days ago
view post
Post
2219
NEW: Real-time conversational AI models can now run 100% locally in your browser! 🤯

🔐 Privacy by design (no data leaves your device)
💰 Completely free... forever
📦 Zero installation required, just visit a website
⚡️ Blazingly-fast WebGPU-accelerated inference

Try it out: webml-community/conversational-webgpu

For those interested, here's how it works:
- Silero VAD for voice activity detection
- Whisper for speech recognition
- SmolLM2-1.7B for text generation
- Kokoro for text to speech

Powered by Transformers.js and ONNX Runtime Web! 🤗 I hope you like it!
  • 2 replies
·
reacted to AdinaY's post with 🔥 2 days ago
New activity in nvidia/C-RADIOv3-B 3 days ago
upvoted an article 4 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others
96