Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Drishti 's Collections
vlm-unlearning
vlm-unlearning-benchmarks
LLMs
Code models
music generation
OCR/VLMs
biomed ner models + spaces
biomed ner
med benchmarks
medllms
STT
Podcast
Summarizer (Mono + Multi-lingual)
Hugging Face
Meal Planner
Cool chatbots
Social Media
Translate
Personal Stylist + Ecom Assistant
Elsa
Professional Development
Doc/PDF RAG
Consilium
Travel Planner
watch AI learn
Research Co-pilot
multi-agent
Code Agent
GitHub
Search and Monitor Gradio MCP Server + REST API
Environment/Climate/Agriculture
OCR
MCP Router + Customizable MCP Agents
Imp Leaderboards
medical/clinical/health
web search + scrape
TTS
One-stop Knowledge Solution
Intellectual Property One-Stop Solution
VLMs

STT

updated 23 days ago
Upvote
-

  • Derur/vosk-stt-models

    Automatic Speech Recognition β€’ Updated Apr 22 β€’ 3

  • Running on Zero
    940
    940

    Whisper Turbo

    🀯

    Transcribe audio from files, microphone, or YouTube


  • Build error

    Deepspeech_live_speech_to_text

    πŸƒ


  • facebook/wav2vec2-base-960h

    Automatic Speech Recognition β€’ 0.1B β€’ Updated Nov 14, 2022 β€’ 1.03M β€’ 358

  • kyutai/stt-1b-en_fr

    Automatic Speech Recognition β€’ Updated 23 days ago β€’ 17 β€’ 74

    Note kyutai released new speech-to-text models that come in 1B & 2B

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs