arunasrivastava commited on
Commit
250567b
ยท
1 Parent(s): e61aa35

added readme

Browse files
Files changed (2) hide show
  1. README.md +121 -9
  2. img/leaderboard.png +0 -0
README.md CHANGED
@@ -1,10 +1,122 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
- title: IPA Transcription Leaderboard
3
- emoji: ๐Ÿ“
4
- colorFrom: purple
5
- colorTo: pink
6
- sdk: gradio
7
- sdk_version: 5.8.0
8
- app_file: app.py
9
- pinned: false
10
- ---
 
1
+ # ๐ŸŽฏ Phonemic Transcription Leaderboard
2
+
3
+ Welcome to the Phonemic Transcription Leaderboard! This simple leaderboard helps you track and compare the performance of different speech-to-phoneme model. Feel free to use it for your own hugging face leaderboards!
4
+
5
+ ## โœจ Features
6
+
7
+ * ๐Ÿ“Š Interactive leaderboard with real-time sorting
8
+ * ๐Ÿ”„ Easy model submission system
9
+ * ๐Ÿ“ˆ Automatic evaluation of submitted models
10
+ * ๐Ÿ“ฑ Responsive design that works on all devices
11
+
12
+ ## ๐ŸŽฏ What This Project Does
13
+
14
+ This leaderboard tracks two key metrics for phonemic transcription models:
15
+
16
+ * **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes
17
+ * **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features
18
+
19
+ Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research.
20
+
21
+ ## ๐Ÿš€ Getting Started
22
+
23
+ ### Prerequisites
24
+
25
+ * Python 3.10
26
+ * Git
27
+ * A love for speech recognition! ๐ŸŽค
28
+
29
+ ### Quick Installation
30
+
31
+ 1. Clone this repository:
32
+
33
+ ```bash
34
+ git clone [your-repo-url]
35
+ cd phonemic-leaderboard
36
+ ```
37
+
38
+ 2. Set up your environment:
39
+
40
+ ```bash
41
+ # Create a virtual environment with Python 3.10
42
+ python3.10 -m venv venv
43
+
44
+ # Activate the virtual environment
45
+ source venv/bin/activate
46
+
47
+ # Install the required dependencies
48
+ pip install -r requirements.txt
49
+ ```
50
+
51
+ 3. Launch the leaderboard:
52
+
53
+ ```bash
54
+ # Run the application
55
+ uvicorn app:app --host 0.0.0.0 --port 7860
56
+ ```
57
+
58
+ 4. Visit `http://localhost:7860` in your browser and see the magic! โœจ
59
+
60
+ ## ๐ŸŽฎ Using the Leaderboard
61
+
62
+ ### Submitting a Model
63
+
64
+ 1. Go to the "Submit Model" tab
65
+ 2. Enter your model details:
66
+ * Model name (e.g., "wav2vec2-phoneme-wizard")
67
+ * Submission name (e.g., "MyAwesomeModel v1.0")
68
+ * GitHub/Kaggle/HuggingFace URL (optional)
69
+ 3. Click Submit and watch your model climb the ranks! ๐Ÿš€
70
+
71
+ ### Checking Model Status
72
+
73
+ 1. Navigate to the "Model Status" tab
74
+ 2. Enter your model name or task ID
75
+ 3. Get real-time updates on your model's evaluation progress
76
+
77
+ ## ๐Ÿ“Š Understanding the Results
78
+
79
+ The leaderboard shows:
80
+
81
+ * Model names and submission details
82
+ * PER and PWED scores (lower is better!)
83
+ * Links to model repositories
84
+ * Submission dates
85
+
86
+ Sort by either metric to see who's leading the pack!
87
+
88
+ ## ๐Ÿ› ๏ธ Technical Details
89
+
90
+ * Built with Gradio for a smooth UI experience
91
+ * Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility
92
+ * Evaluation can take several hours - perfect time to grab a coffee โ˜•
93
+
94
+ ## ๐Ÿค Contributing
95
+
96
+ Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute:
97
+
98
+ * Add new evaluation metrics
99
+ * Improve the UI design
100
+ * Enhance documentation
101
+ * Submit bug fixes
102
+ * Add new features
103
+
104
+ ## ๐Ÿ“ License
105
+
106
+ This project is licensed under the MIT License - see the LICENSE file for details.
107
+
108
+ ## ๐ŸŒŸ Acknowledgments
109
+
110
+ * Thanks to the TIMIT speech corpus for providing evaluation data
111
+ * Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations
112
+ * Built with love by Koel Labs ๐Ÿ’™
113
+
114
+ ## ๐Ÿ†˜ Need Help?
115
+
116
+ Got questions? Found a bug? Want to contribute? Open an issue or reach out to us! We're here to help make speech recognition evaluation fun and accessible for everyone!
117
+
118
+ Remember: Every great model deserves its moment to shine! ๐ŸŒŸ
119
+
120
  ---
121
+
122
+ Happy Transcribing! ๐ŸŽคโœจ
 
 
 
 
 
 
 
img/leaderboard.png ADDED