File size: 792 Bytes
a3fb860
cb012cd
a3fb860
 
 
 
 
 
 
dff3004
a3fb860
 
db8d886
cb012cd
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
title: Speech Segmenter (STT)
emoji: 🏃
colorFrom: gray
colorTo: blue
sdk: gradio
sdk_version: 5.39.0
app_file: app.py
pinned: false
short_description: Advanced audio transcription with alignment & diarization 
---

This Space provides an advanced **Speech-to-Text (STT)** pipeline enhanced with alignment and speaker diarization:

- **STT (Speech-to-Text):** Converts spoken audio into written text (transcription).
- **Alignment:** Aligns words with their timestamps in the audio (word-level timing).
- **Speaker Diarization:** Detects and labels who spoke when — the “who spoke what” part.
- **Post-processing:** Combines all that info to produce a richer, structured transcript.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference