Spaces:
Running
Running
arunasrivastava
commited on
Commit
ยท
250567b
1
Parent(s):
e61aa35
added readme
Browse files- README.md +121 -9
- img/leaderboard.png +0 -0
README.md
CHANGED
@@ -1,10 +1,122 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
-
|
3 |
-
|
4 |
-
colorFrom: purple
|
5 |
-
colorTo: pink
|
6 |
-
sdk: gradio
|
7 |
-
sdk_version: 5.8.0
|
8 |
-
app_file: app.py
|
9 |
-
pinned: false
|
10 |
-
---
|
|
|
1 |
+
# ๐ฏ Phonemic Transcription Leaderboard
|
2 |
+
|
3 |
+
Welcome to the Phonemic Transcription Leaderboard! This simple leaderboard helps you track and compare the performance of different speech-to-phoneme model. Feel free to use it for your own hugging face leaderboards!
|
4 |
+
|
5 |
+
## โจ Features
|
6 |
+
|
7 |
+
* ๐ Interactive leaderboard with real-time sorting
|
8 |
+
* ๐ Easy model submission system
|
9 |
+
* ๐ Automatic evaluation of submitted models
|
10 |
+
* ๐ฑ Responsive design that works on all devices
|
11 |
+
|
12 |
+
## ๐ฏ What This Project Does
|
13 |
+
|
14 |
+
This leaderboard tracks two key metrics for phonemic transcription models:
|
15 |
+
|
16 |
+
* **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes
|
17 |
+
* **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features
|
18 |
+
|
19 |
+
Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research.
|
20 |
+
|
21 |
+
## ๐ Getting Started
|
22 |
+
|
23 |
+
### Prerequisites
|
24 |
+
|
25 |
+
* Python 3.10
|
26 |
+
* Git
|
27 |
+
* A love for speech recognition! ๐ค
|
28 |
+
|
29 |
+
### Quick Installation
|
30 |
+
|
31 |
+
1. Clone this repository:
|
32 |
+
|
33 |
+
```bash
|
34 |
+
git clone [your-repo-url]
|
35 |
+
cd phonemic-leaderboard
|
36 |
+
```
|
37 |
+
|
38 |
+
2. Set up your environment:
|
39 |
+
|
40 |
+
```bash
|
41 |
+
# Create a virtual environment with Python 3.10
|
42 |
+
python3.10 -m venv venv
|
43 |
+
|
44 |
+
# Activate the virtual environment
|
45 |
+
source venv/bin/activate
|
46 |
+
|
47 |
+
# Install the required dependencies
|
48 |
+
pip install -r requirements.txt
|
49 |
+
```
|
50 |
+
|
51 |
+
3. Launch the leaderboard:
|
52 |
+
|
53 |
+
```bash
|
54 |
+
# Run the application
|
55 |
+
uvicorn app:app --host 0.0.0.0 --port 7860
|
56 |
+
```
|
57 |
+
|
58 |
+
4. Visit `http://localhost:7860` in your browser and see the magic! โจ
|
59 |
+
|
60 |
+
## ๐ฎ Using the Leaderboard
|
61 |
+
|
62 |
+
### Submitting a Model
|
63 |
+
|
64 |
+
1. Go to the "Submit Model" tab
|
65 |
+
2. Enter your model details:
|
66 |
+
* Model name (e.g., "wav2vec2-phoneme-wizard")
|
67 |
+
* Submission name (e.g., "MyAwesomeModel v1.0")
|
68 |
+
* GitHub/Kaggle/HuggingFace URL (optional)
|
69 |
+
3. Click Submit and watch your model climb the ranks! ๐
|
70 |
+
|
71 |
+
### Checking Model Status
|
72 |
+
|
73 |
+
1. Navigate to the "Model Status" tab
|
74 |
+
2. Enter your model name or task ID
|
75 |
+
3. Get real-time updates on your model's evaluation progress
|
76 |
+
|
77 |
+
## ๐ Understanding the Results
|
78 |
+
|
79 |
+
The leaderboard shows:
|
80 |
+
|
81 |
+
* Model names and submission details
|
82 |
+
* PER and PWED scores (lower is better!)
|
83 |
+
* Links to model repositories
|
84 |
+
* Submission dates
|
85 |
+
|
86 |
+
Sort by either metric to see who's leading the pack!
|
87 |
+
|
88 |
+
## ๐ ๏ธ Technical Details
|
89 |
+
|
90 |
+
* Built with Gradio for a smooth UI experience
|
91 |
+
* Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility
|
92 |
+
* Evaluation can take several hours - perfect time to grab a coffee โ
|
93 |
+
|
94 |
+
## ๐ค Contributing
|
95 |
+
|
96 |
+
Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute:
|
97 |
+
|
98 |
+
* Add new evaluation metrics
|
99 |
+
* Improve the UI design
|
100 |
+
* Enhance documentation
|
101 |
+
* Submit bug fixes
|
102 |
+
* Add new features
|
103 |
+
|
104 |
+
## ๐ License
|
105 |
+
|
106 |
+
This project is licensed under the MIT License - see the LICENSE file for details.
|
107 |
+
|
108 |
+
## ๐ Acknowledgments
|
109 |
+
|
110 |
+
* Thanks to the TIMIT speech corpus for providing evaluation data
|
111 |
+
* Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations
|
112 |
+
* Built with love by Koel Labs ๐
|
113 |
+
|
114 |
+
## ๐ Need Help?
|
115 |
+
|
116 |
+
Got questions? Found a bug? Want to contribute? Open an issue or reach out to us! We're here to help make speech recognition evaluation fun and accessible for everyone!
|
117 |
+
|
118 |
+
Remember: Every great model deserves its moment to shine! ๐
|
119 |
+
|
120 |
---
|
121 |
+
|
122 |
+
Happy Transcribing! ๐คโจ
|
|
|
|
|
|
|
|
|
|
|
|
|
|
img/leaderboard.png
ADDED