openai/whisper-large-v3

Can Deepspeed ZeRO3 be used for Sharding with Whisper-large-v3?

#204 opened 19 days ago by

PhoenixAxis

Fix example usage

#202 opened 23 days ago by

bezzam

RuntimeError: Trying to backward through the graph a second time

👍 1

3

#201 opened 29 days ago by

Sarwg

语音模型

#200 opened 30 days ago by

skyhe666

Incorrect feature_size in preprocessor_config.json (should be 80)

1

#199 opened about 1 month ago by

alexg1802

Upload 60fps Audio 2_45.wav

#198 opened about 1 month ago by

11fred11

Audio refinning

#197 opened about 1 month ago by

Baalbatos

BTS

2

#196 opened about 1 month ago by

Birima

Upload AUDIO.mp3.mp3

#195 opened about 1 month ago by

OJG15

Video format problem.

#194 opened about 1 month ago by

pantorn

Word Timestamps error: RuntimeError: The expanded size of the tensor (69) must match the existing size (72) at non-singleton dimension 1. Target sizes: [1, 69]. Tensor sizes: [72] whisper word.

👍 1

1

#193 opened about 2 months ago by

muaviyaijaz123

Muniyan

#192 opened about 2 months ago by

Orion14

two warnings

#190 opened 2 months ago by

youarecode

whisper large v3 finetuning using our own dataset

1

#189 opened 3 months ago by

rifasca

Upload New Recording 3.m4a

#188 opened 3 months ago by

Hellometo

Why does a tiny silence at the start of my audio change Whisper’s transcription?

#187 opened 4 months ago by

dylanewbie

Upload 10 files

#186 opened 5 months ago by

Usshhaa

Whisper openai vs hugging face question

#185 opened 5 months ago by

Sin2pi

what are the limits for Whisper here?

#184 opened 5 months ago by

vendeza

Rename README.md to README.mdwas könnte die Ursache für Kopf- und Nackenschmerzen Blutdruckschwankungen Papillenödem und übelkeit sein

1

#183 opened 5 months ago by

Oskar000

getting an error trying to extract word-level timestamps

#182 opened 5 months ago by

SphinxKing

Download error. Manually specifying the catalog with models.

#181 opened 5 months ago by

Incrediblecat

Can I use a CLI of it? Just found this project, no experience with AI, just want to convert audio speech to text

#180 opened 5 months ago by

vitaly-zdanevich

Upload AUD-20250217-WA0001.m4a

#179 opened 5 months ago by

Gerald02

Extract words with timestamps

#178 opened 6 months ago by

codingyash

Working example on Mac M-series

#177 opened 6 months ago by

pajikos

[badcase report] observe significant recognition errors on an example of libri_light dataset.

#176 opened 6 months ago by

lawlict

Upload audio1477337879.m4a

#175 opened 6 months ago by

Elena9292

Multi task finetuning (transcribe and translate)

#174 opened 7 months ago by

Phil-AB

SegmentAnything Ultra V2

1

#173 opened 7 months ago by

didi4725

How to freeze layers and fine tuning

2

#172 opened 7 months ago by

Sarwg

Challenges in Distinguishing Similar Phonemes (e.g., 'B' and 'V') in License Plate Speech Recognition Using Whisper-Large Model

#171 opened 8 months ago by

dylanewbie

Inference on fine-tuned whisper-large-v3 is not working, but is working on pre-trained model and whisper-medium

#169 opened 8 months ago by

ivabojic

🚩 Report: Not working

#168 opened 9 months ago by

ednsinf

Speech recognition broken down by speakers

👀 1

5

#167 opened 9 months ago by

tur0kmag

the chinese training data of the model is contaminated

2

#165 opened 9 months ago by

bookwoods123

Specify language for transcribing with HuggingFace API

7

#164 opened 9 months ago by

mikealexx

Isolate search for a single language

#163 opened 10 months ago by

edyrkaj

Whisper-GUI

2

#162 opened 10 months ago by

PrensCin

How to get SRT file as output

👀 👍 2

1

#161 opened 10 months ago by

dinesh-001

whisper large v3 turbo

2

#160 opened 10 months ago by

deepdml

Missing spaces between chunks in longform fine tune outputs & importance of tokenizer.json

1

#159 opened 10 months ago by

saj-bot

Update README.md

1

#158 opened 10 months ago by

Ironajijiul11

What is the accuracy of the model? Why can't I fine-tune when I set the accuracy to FP16?

1

#157 opened 11 months ago by

chengligen

Help for absolute AI beginners

2

#156 opened 11 months ago by

TobiasKuch

Single word transcription for a audio file with ~1.5m frames

#155 opened 11 months ago by

KevalRx

how to get n-best list generated by Whisper.

1

#153 opened 11 months ago by

louisguo

Hugging Face Model Deployment and Library Dependency Issues

#152 opened 11 months ago by

NeuraFusionAI

Issue - Internal Server Error (Serverless API)

#151 opened 11 months ago by

tushar310

how many GPU memory do I need to finetune largeV3

4

#150 opened 12 months ago by

lanejohn