🔄 In a Training Loop

Omar Sanseviero PRO

osanseviero

google

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

liked a Space 3 days ago

google/gemma4_vision_token_budget

updated a collection 9 days ago

authored a paper 21 days ago

Gemma 4 Technical Report

View all activity

Organizations

upvoted a paper 22 days ago

Gemma 4 Technical Report

Paper • 2607.02770 • Published 29 days ago • 75

upvoted an article 29 days ago

Article

Hugging Face and Cerebras bring Gemma 4 to real-time voice AI

+2

A-Mahla, andito, lvwerra, vyassaurabh

•

30 days ago

• 89

upvoted a collection about 2 months ago

DiffusionGemma

1 item • Updated 9 days ago • 60

upvoted a collection 3 months ago

Gemma 4

16 items • Updated 9 days ago • 1.06k

upvoted an article 5 months ago

Article

Konkani LLM: Bringing a Multi-Script Low-Resource Language to the AI Era

Reubencf

•

Mar 7

• 8

upvoted a collection 7 months ago

T5Gemma 2

3 items • Updated 9 days ago • 79

upvoted a paper 10 months ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 51

upvoted 2 articles 11 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 189

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 275

upvoted a collection 11 months ago

EmbeddingGemma

3 items • Updated 9 days ago • 123

upvoted a collection about 1 year ago

T5Gemma

32 items • Updated 9 days ago • 85

upvoted an article about 1 year ago

Article

Gemma 3n fully available in the open-source ecosystem!

+6

ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif

•

Jun 26, 2025

• 122

upvoted a paper about 1 year ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 41

upvoted a changelog about 1 year ago

Hugging Face Changelog

New Inference Providers Dashboard

Jun 5, 2025

• 73

upvoted a collection about 1 year ago

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated Jun 4, 2025 • 10

upvoted a paper about 1 year ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23, 2025 • 63

upvoted an article about 1 year ago

Article

The Transformers Library: standardizing model definitions

+2

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 123

upvoted 2 collections about 1 year ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated 9 days ago • 516

Gemma 3n Preview

4 items • Updated 9 days ago • 210

upvoted an article over 1 year ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

ysharma, abidlabs

•

Apr 16, 2025

• 44