|
--- |
|
library_name: peft |
|
base_model: openai/whisper-large-v3 |
|
--- |
|
|
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
|
|
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
|
|
|
|
- **Developed by:** Flurin17 |
|
- **Language(s) (NLP):** swiss-german |
|
- **License:** IDK ask openai |
|
- **Finetuned from model [optional]:** openai/whisper-large-v3 |
|
|
|
### Model Sources [optional] |
|
|
|
|
|
## Uses |
|
|
|
from transformers import AutomaticSpeechRecognitionPipeline |
|
import torch |
|
pipe = AutomaticSpeechRecognitionPipeline(model=model, tokenizer=tokenizer, feature_extractor=feature_extractor) |
|
|
|
with torch.cuda.amp.autocast(): |
|
result = pipe(r"L:\Coding\random\audio.mp3", generate_kwargs={"language": "german"}) |
|
print(result["text"]) |
|
|
|
|
|
|
|
### Framework versions |
|
|
|
- PEFT 0.7.1 |