islomov
/

navaistt_v2_medium

Automatic Speech Recognition

audio-transcription

speech-recognition

Model card Files Files and versions Community

Ready for deploy STT Service WebUI - based on v2 model

#1

pinned

by nek1987 - opened Jun 24

nek1987

Jun 24

•

A Docker-based STT (Automatic Speech Recognition) system using faster-whisper with a Gradio web UI.
Features

faster‑whisper with CTranslate2 for ~2× real-time transcription on GPU
Lazy loading: model loads on first request (20–30 s), then stays in memory
Token‑based auth: secure /transcribe with X-API-KEY
UI Basic Auth: protect Gradio interface with username/password
Multi‑format support: WAV, MP3, OGG/Opus, M4A, FLAC, AMR, etc.
Health check: /healthz returns OK immediately

https://github.com/nek1987/stt-service-webui/

islomov pinned discussion Jun 24

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment