File size: 3,437 Bytes
381f539
c441a7f
 
 
 
 
 
38024bc
381f539
38024bc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
---
title: IPA Transcription Leaderboard
emoji: ๐Ÿ“
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.8.0
app_file: app/app.py
pinned: false
---

# ๐ŸŽฏ Phonemic Transcription Leaderboard

Welcome to the Phonemic Transcription Leaderboard! This simple leaderboard helps track and compare the performance of different speech-to-phoneme models. Feel free to fork it for your own hugging face leaderboards! 

![leaderboard](img/leaderboard.png)

## โœจ Features

* ๐Ÿ“Š Interactive leaderboard with real-time sorting
* ๐Ÿ”„ Easy model submission system
* ๐Ÿ“ˆ Automatic evaluation of submitted models 
* ๐Ÿ“ฑ Responsive design that works on all devices

## ๐ŸŽฏ What This Project Does

This leaderboard tracks two key metrics for phonemic transcription models:


* **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes
* **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features

Read more about evaluations on our [blog](https://www.koellabs.com/blog/phonemic-transcription-metrics)

Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research.

## ๐Ÿš€ Getting Started

Navigate to the hosted version on [Hugging Face](https://huggingface.co/spaces/KoelLabs/IPA-Transcription-EN) or follow the instructions in [DEVELOPMENT.md](DEVELOPMENT.md) to run the leaderboard locally.

## ๐ŸŽฎ Using the Leaderboard

### Submitting a Model

1. Go to the "Submit Model" tab
2. Enter your model details:
   * Model name (e.g., "wav2vec2-phoneme-wizard")
   * Submission name (e.g., "MyAwesomeModel v1.0")
   * GitHub/Kaggle/HuggingFace URL (optional)
3. Click Submit and watch your model climb the ranks! ๐Ÿš€

### Checking Model Status

1. Navigate to the "Model Status" tab
2. Enter your model name or task ID
3. Get real-time updates on your model's evaluation progress

## ๐Ÿ“Š Understanding the Results

The leaderboard shows:

* Model names and submission details
* PER and PWED scores (lower is better!)
* Links to model repositories
* Submission dates

Sort by either metric to see who's leading the pack!

## ๐Ÿ› ๏ธ Technical Details

* Built with Gradio for a smooth UI experience
* Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility
* Evaluation can take several hours - perfect time to grab a coffee โ˜•

## ๐Ÿค Contributing

Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute:

* Add new evaluation metrics
* Improve the UI design
* Enhance documentation
* Submit bug fixes
* Add new features

Checkout the [CONTRIBUTING.md](CONTRIBUTING.md) for more details.

## ๐Ÿ“ License

This project is licensed under the GNU Affero General Public License.

We retain all rights to the Koel Labs brand, logos, blog posts and website content.

## ๐ŸŒŸ Acknowledgments

* Thanks to the TIMIT speech corpus for providing evaluation data
* Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations
* Built with love by Koel Labs ๐Ÿ’™

## ๐Ÿ†˜ Need Help?

Got questions? Found a bug? Want to contribute? [Open an issue](https://huggingface.co/spaces/KoelLabs/IPA-Transcription-EN/discussions) or [reach out to us](mailto:[email protected])! We're here to help make speech recognition evaluation fun and accessible for everyone!

Remember: Every great model deserves its moment to shine! ๐ŸŒŸ

---

Happy Transcribing! ๐ŸŽคโœจ