Anurag's picture

Building on HF

Anurag

edwixx

·

https://anuragkanade.com/

AI & ML interests

Machine Learning, and Speech

Recent Activity

updated a model about 6 hours ago

edwixx/miraTTS-hindi

published a model about 12 hours ago

edwixx/miraTTS-hindi

updated a dataset about 13 hours ago

edwixx/hindi-female-tts

View all activity

Organizations

upvoted an article about 1 month ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

296

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted 2 articles 2 months ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Nov 5, 2025

•

10

Article

G2P Shrinks Speech Models

Feb 5, 2025

•

82

upvoted a changelog 2 months ago

Changelog

Set Default Sorting in the Community Tab

Oct 28, 2025

• 68

upvoted 2 articles 2 months ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

•

37

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

139

upvoted a collection 3 months ago

TTS

Collection of some of the TTS models i found cool • 6 items • Updated Oct 10, 2025 • 1

upvoted a paper 4 months ago

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 61

upvoted an article 5 months ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

+1

Jul 25, 2025

•

83

upvoted an article 7 months ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

108

upvoted 2 collections 8 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11, 2025 • 369

Qwen3

84 items • Updated 6 days ago • 1.54k

upvoted a collection 9 months ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 109

upvoted 2 papers 10 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11, 2025 • 71

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25, 2025 • 59

upvoted an article 11 months ago

Article

Build awesome datasets for video generation

Feb 12, 2025

•

34

upvoted a paper 12 months ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published Dec 19, 2024 • 20

upvoted 2 collections about 1 year ago

🎨 Image models

10 items • Updated 1 day ago • 2

BhasaAnuvaad

A Speech Translation Dataset for 13 Indian Languages • 11 items • Updated Jan 16, 2025 • 25