Dynamic-SUPERB

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

juice500 authored a paper 12 days ago

Wav2Gloss: Generating Interlinear Glossed Text from Speech

juice500 authored a paper 12 days ago

TiDAL: Learning Training Dynamics for Active Learning

juice500 authored a paper 12 days ago

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

View all activity

speech31

authored 3 papers 12 days ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 4

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28, 2025 • 4

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published 13 days ago • 6

kalbin

authored a paper 12 days ago

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published 13 days ago • 6

kalbin

authored a paper 19 days ago

Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects

Paper • 2601.07274 • Published 21 days ago • 1

dlion168

submitted a paper to Daily Papers 20 days ago

On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation

Paper • 2601.06329 • Published 23 days ago • 2

kalbin

authored 3 papers about 2 months ago

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate

Paper • 2304.02541 • Published Apr 5, 2023 • 2

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 4

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28, 2025 • 4

Steveeeeeeen

authored 2 papers 2 months ago

Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement

Paper • 2510.23141 • Published Oct 27, 2025 • 5

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8, 2025 • 10

yenting-biao

authored a paper 3 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 20

zenyn

authored 2 papers 3 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 20

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 18

ga642381

authored 3 papers 4 months ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 4

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18

Game-Time: Evaluating Temporal Dynamics in Spoken Language Models

Paper • 2509.26388 • Published Sep 30, 2025 • 27

WeiChihChen

authored 3 papers 4 months ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 4

BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights

Paper • 2501.17790 • Published Jan 29, 2025 • 3

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18