Spaces:

hf-audio
/

open_asr_leaderboard

Running on CPU Upgrade

App Files Files Community

Please add State of the Art proprietary models to OpenASR leaderboard

#37

by Buttermilk03 - opened 10 days ago

Discussion

Buttermilk03

10 days ago

•

edited 10 days ago

Add Gemini 2.5 Pro
According to the release paper Gemini 2.5 pro could be SOTA
(https://huggingface.co/google)

Add Soniox
Please add Soniox https://soniox.com/

Documentation for Async Transcription
https://soniox.com/docs/speech-to-text/get-started/transcribe-file

Documentation for realtime transcription
https://soniox.com/docs/speech-to-text/get-started/transcribe-realtime
https://huggingface.co/soniox

Add Amazon Nova Sonic
Please add Amazon Nova Sonic https://aws.amazon.com/de/ai/generative-ai/nova/speech/

API Documentation
https://docs.aws.amazon.com/nova/latest/userguide/speech.html

Add OpenAI gpt4o-transcribe
Please add OpenAI Gpt4o-transcribe https://openai.com/index/introducing-our-next-generation-audio-models/

Here is the API Documentation
https://platform.openai.com/docs/api-reference/audio/createTranscription

Add Wizper V3 from fal.ai
Please add https://fal.ai/models/fal-ai/wizper to the evaluation

API Documentation:
https://fal.ai/models/fal-ai/wizper/api

Add cartesia-ink
Please add https://cartesia.ai/ink to the evaluation

API doc
https://docs.cartesia.ai/2025-04-16/build-with-cartesia/models/stt#ink-whisper

Buttermilk03 changed discussion title from Please add State of the Art proprietary models to Please add State of the Art proprietary models to OpenASR bechmark 10 days ago

Buttermilk03 changed discussion title from Please add State of the Art proprietary models to OpenASR bechmark to Please add State of the Art proprietary models to OpenASR leaderboard 10 days ago

Buttermilk03

5 days ago

@Steveeeeeeen : What do you think, do you have plans to add more models?

There are just many lacking models. Most benchmarks are not independent. OpenASR is a relief here. I found one more benchmark here, https://voicewriter.io/speech-recognition-leaderboard , they also test the Gemini models where they are in the Top 3. The benchmark however is not as comprehensive as the OpenASR.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment