gg-hf

Enterprise

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

jrd971000 authored a paper 6 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

jrd971000 authored a paper 8 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

jrd971000 authored a paper 8 days ago

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

View all activity

Xenova

posted an update 5 days ago

Post

2282

Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! 🤯
Demo (+ source code): webml-community/DINOv3-video-tracking

This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! 😍

How does it work? 🤔
1️⃣ Generate and cache image features for each frame
2️⃣ Create a list of embeddings for selected patch(es)
3️⃣ Compute cosine similarity between each patch and the selected patch(es)
4️⃣ Highlight those whose score is above some threshold

... et voilà! 🥳

You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.

Excited to see what the community builds with it!

jrd971000

authored a paper 6 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 7 days ago • 31

jrd971000

authored 2 papers 8 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 13

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published Apr 15 • 10

Xenova

posted an update 21 days ago

Post

3937

The next generation of AI-powered websites is going to be WILD! 🤯

In-browser tool calling & MCP is finally here, allowing LLMs to interact with websites programmatically.

To show what's possible, I built a demo using Liquid AI's new LFM2 model, powered by 🤗 Transformers.js: LiquidAI/LFM2-WebGPU

As always, the demo is open source (which you can find under the "Files" tab), so I'm excited to see how the community builds upon this! 🚀

1 reply

Rayne

updated a model 29 days ago

gg-hf/gemma-2b-mini-clrs

3B • Updated 29 days ago • 19

Rayne

published a model 29 days ago

gg-hf/gemma-2b-mini-clrs

3B • Updated 29 days ago • 19

Xenova

posted an update about 1 month ago

Post

2997

Introducing Voxtral WebGPU: State-of-the-art audio transcription directly in your browser! 🤯
🗣️ Transcribe videos, meeting notes, songs and more
🔐 Runs on-device, meaning no data is sent to a server
🌎 Multilingual (8 languages)
🤗 Completely free (forever) & open source

That's right, we're running Mistral's new Voxtral-Mini-3B model 100% locally in-browser on WebGPU, powered by Transformers.js and ONNX Runtime Web! 🔥

Try it out yourself! 👇
webml-community/Voxtral-WebGPU

ariG23498

posted an update about 1 month ago

Post

695

I have always advocated for writing techinical stories without using LLMs.

The following one page editorial really drives the point home.
https://www.nature.com/articles/s44222-025-00323-4

bzhanggo

updated 11 models about 2 months ago

AI & ML interests

Recent Activity

Team members 121

gg-hf's activity