Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aikongfu 's Collections
embedding benchmark
AI agent
LLM
speech recognition
AI Coding
Computer Vision(Text to Image)
Text to Audio
Multimodal
Audio to text
Datasets
Text to Video
image to video

speech recognition

updated Nov 21, 2024
Upvote
-

  • Running on L40S
    2.37k
    2.37k

    Whisper

    📉

    Transcribe audio from microphone, files, or YouTube


  • Running on Zero
    363
    363

    Video Transcription Smart Summary

    ⚡

    Generate summaries from YouTube videos or uploaded videos


  • Running on Zero
    706
    706

    Whisper Large V3

    🤫

    Transcribe audio and YouTube videos to text


  • Running on Zero
    800
    800

    Video Dubbing (SoniTranslate)

    🌍

    Video Dubbing with Open Source Projects


  • Running
    269
    269

    Faster Whisper Webui

    🚀

    Transcribe audio to text with speaker diarization

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs