SpeechSegmenter / README.md
Rafii's picture
Update README.md
dff3004 verified

A newer version of the Gradio SDK is available: 5.40.0

Upgrade
metadata
title: Speech Segmenter (STT)
emoji: 🏃
colorFrom: gray
colorTo: blue
sdk: gradio
sdk_version: 5.39.0
app_file: app.py
pinned: false
short_description: Advanced audio transcription with alignment & diarization

This Space provides an advanced Speech-to-Text (STT) pipeline enhanced with alignment and speaker diarization:

  • STT (Speech-to-Text): Converts spoken audio into written text (transcription).
  • Alignment: Aligns words with their timestamps in the audio (word-level timing).
  • Speaker Diarization: Detects and labels who spoke when — the “who spoke what” part.
  • Post-processing: Combines all that info to produce a richer, structured transcript.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference