openai/whisper-large-v3

Video format problem.

#194 opened 1 day ago by

pantorn

Word Timestamps error: RuntimeError: The expanded size of the tensor (69) must match the existing size (72) at non-singleton dimension 1. Target sizes: [1, 69]. Tensor sizes: [72] whisper word.

1

#193 opened 6 days ago by

muaviyaijaz123

Muniyan

#192 opened 13 days ago by

Orion14

two warnings

#190 opened 28 days ago by

youarecode

whisper large v3 finetuning using our own dataset

1

#189 opened about 2 months ago by

rifasca

Upload New Recording 3.m4a

#188 opened about 2 months ago by

Hellometo

Why does a tiny silence at the start of my audio change Whisper’s transcription?

#187 opened 2 months ago by

dylanewbie

Upload 10 files

#186 opened 3 months ago by

Usshhaa

Whisper openai vs hugging face question

#185 opened 3 months ago by

Sin2pi

what are the limits for Whisper here?

#184 opened 3 months ago by

vendeza

Rename README.md to README.mdwas könnte die Ursache für Kopf- und Nackenschmerzen Blutdruckschwankungen Papillenödem und übelkeit sein

1

#183 opened 3 months ago by

Oskar000

getting an error trying to extract word-level timestamps

#182 opened 3 months ago by

SphinxKing

Download error. Manually specifying the catalog with models.

#181 opened 4 months ago by

Incrediblecat

Can I use a CLI of it? Just found this project, no experience with AI, just want to convert audio speech to text

#180 opened 4 months ago by

vitaly-zdanevich

Upload AUD-20250217-WA0001.m4a

#179 opened 4 months ago by

Gerald02

Extract words with timestamps

#178 opened 4 months ago by

codingyash

Working example on Mac M-series

#177 opened 4 months ago by

pajikos

[badcase report] observe significant recognition errors on an example of libri_light dataset.

#176 opened 4 months ago by

lawlict

Upload audio1477337879.m4a

#175 opened 4 months ago by

Elena9292

Multi task finetuning (transcribe and translate)

#174 opened 5 months ago by

Phil-AB

SegmentAnything Ultra V2

1

#173 opened 6 months ago by

didi4725

How to freeze layers and fine tuning

2

#172 opened 6 months ago by

Sarwg

Challenges in Distinguishing Similar Phonemes (e.g., 'B' and 'V') in License Plate Speech Recognition Using Whisper-Large Model

#171 opened 6 months ago by

dylanewbie

Inference on fine-tuned whisper-large-v3 is not working, but is working on pre-trained model and whisper-medium

#169 opened 7 months ago by

ivabojic

🚩 Report: Not working

#168 opened 7 months ago by

ednsinf

Speech recognition broken down by speakers

👀 1

5

#167 opened 8 months ago by

tur0kmag

the chinese training data of the model is contaminated

2

#165 opened 8 months ago by

bookwoods123

Specify language for transcribing with HuggingFace API

7

#164 opened 8 months ago by

mikealexx

Isolate search for a single language

#163 opened 8 months ago by

edyrkaj

Whisper-GUI

2

#162 opened 8 months ago by

PrensCin

How to get SRT file as output

👍 👀 2

1

#161 opened 8 months ago by

dinesh-001

whisper large v3 turbo

2

#160 opened 9 months ago by

deepdml

Missing spaces between chunks in longform fine tune outputs & importance of tokenizer.json

1

#159 opened 9 months ago by

saj-bot

Update README.md

1

#158 opened 9 months ago by

Ironajijiul11

What is the accuracy of the model? Why can't I fine-tune when I set the accuracy to FP16?

1

#157 opened 9 months ago by

chengligen

Help for absolute AI beginners

2

#156 opened 9 months ago by

TobiasKuch

Single word transcription for a audio file with ~1.5m frames

#155 opened 9 months ago by

KevalRx

how to get n-best list generated by Whisper.

1

#153 opened 10 months ago by

louisguo

Hugging Face Model Deployment and Library Dependency Issues

#152 opened 10 months ago by

NeuraFusionAI

Issue - Internal Server Error (Serverless API)

#151 opened 10 months ago by

tushar310

how many GPU memory do I need to finetune largeV3

4

#150 opened 10 months ago by

lanejohn

Better output in INT8

1

#149 opened 10 months ago by

aney

how to translate model ( whisper-small ) to pt file (small.pt)?

4

#146 opened 10 months ago by

lihenan1996

how to get the same output result format from the pipeline as we get from the open ai whisper?

➕ 2

#145 opened 10 months ago by

aheed911

🚩 Report

#143 opened 11 months ago by

dinlun777

الأصدقاء

1

#142 opened 11 months ago by

monir2006

Add Urdu (Pakistan) language speech to text detection

1

#141 opened 11 months ago by

AamirFarooq

whisper segments

#138 opened 11 months ago by

world-of-ai

Git repository or how to instructions on downloading and using the model

#137 opened 11 months ago by

J-PROGRAMMER

Rename README.md to wangdaoqi

#136 opened 12 months ago by

dqoqi