Spaces:

langtech-innovation
/

WhisperLiveKitDiarization

Paused

Dominik Macháček commited on Jan 3, 2024

Commit

bfbe83d

Samples should be an integer, not seconds

- Merge pull request #49 from skripnik/patch-1
- tested performance -- ESIC dev2, 27 docs, on En, De, Cs ASR, Nvidia A40, min chunk 1s, VAD => it has lower WER and latency with "segment" buffer trimming with various thresholds

Files changed (1) hide show

whisper_online.py +1 -1

whisper_online.py CHANGED Viewed

@@ -355,7 +355,7 @@ class OnlineASRProcessor:
         """
         self.transcript_buffer.pop_commited(time)
         cut_seconds = time - self.buffer_time_offset
-        self.audio_buffer = self.audio_buffer[int(cut_seconds)*self.SAMPLING_RATE:]
         self.buffer_time_offset = time
         self.last_chunked_at = time

         """
         self.transcript_buffer.pop_commited(time)
         cut_seconds = time - self.buffer_time_offset
+        self.audio_buffer = self.audio_buffer[int(cut_seconds*self.SAMPLING_RATE):]
         self.buffer_time_offset = time
         self.last_chunked_at = time