12 112 546

Daniel Bourke PRO

mrdbourke

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

updated a Space 2 days ago

mrdbourke/LLMDet-demo

liked a Space 2 days ago

mrdbourke/LLMDet-demo

published a Space 2 days ago

mrdbourke/LLMDet-demo

View all activity

Organizations

None yet

mrdbourke's activity

updated a Space 2 days ago

LLMDet Demo

🏃

Open-vocabulary object detection with LLMDet.

liked a Space 2 days ago

LLMDet Demo

🏃

Open-vocabulary object detection with LLMDet.

published a Space 2 days ago

LLMDet Demo

🏃

Open-vocabulary object detection with LLMDet.

reacted to merve's post with 🚀 2 days ago

Post

1335

Yesterday was the day of vision language action models (VLAs)!

> SmolVLA: open-source small VLA for robotics by Hugging Face LeRobot team 🤖
Blog: https://huggingface.co/blog/smolvla
Model: lerobot/smolvla_base

> Holo-1: 3B & 7B web/computer use agentic VLAs by H Company 💻
Model family: Hcompany/holo1-683dd1eece7eb077b96d0cbd
Demo: https://huggingface.co/spaces/multimodalart/Holo1
Blog: https://huggingface.co/blog/Hcompany/holo1
super exciting times!!

reacted to Xenova's post with 🔥 2 days ago

Post

2219

NEW: Real-time conversational AI models can now run 100% locally in your browser! 🤯

🔐 Privacy by design (no data leaves your device)
💰 Completely free... forever
📦 Zero installation required, just visit a website
⚡️ Blazingly-fast WebGPU-accelerated inference

Try it out: webml-community/conversational-webgpu

For those interested, here's how it works:
- Silero VAD for voice activity detection
- Whisper for speech recognition
- SmolLM2-1.7B for text generation
- Kokoro for text to speech

Powered by Transformers.js and ONNX Runtime Web! 🤗 I hope you like it!

2 replies

reacted to AdinaY's post with 🔥 2 days ago

Post

1717

New models from Qwen 🔥

Qwen3-Embedding and Qwen3-Reranker Series just released on the hub by
Alibaba Qwen team.

✨ 0.6B/ 4B/ 8B with Apache2.0
✨ Supports 119 languages 🤯
✨ Top-tier performance: Leading the MTEB multilingual leaderboard！

Reranker:
Qwen/qwen3-reranker-6841b22d0192d7ade9cdefea
Embedding:
Qwen/qwen3-embedding-6841b2055b99c44d9a4c371f