AudioLLMs

non-profit

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

binwang updated a dataset 8 days ago

AudioLLMs/MMAU-mini-do-not-use

binwang authored a paper about 1 month ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

binwang new activity 4 months ago

AudioLLMs/spoken_squad_test:License for Spoken-SQuAD

View all activity

binwang

updated a dataset 8 days ago

AudioLLMs/MMAU-mini-do-not-use

Viewer • Updated 8 days ago • 1k • 126 • 2

binwang

authored a paper about 1 month ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 126

binwang

in AudioLLMs/spoken_squad_test 4 months ago

License for Spoken-SQuAD

#1 opened 4 months ago by

michaellee886

binwang

updated a dataset 4 months ago

AudioLLMs/spoken_squad_test

Viewer • Updated May 6 • 5.35k • 154

binwang

updated a dataset 5 months ago

AudioLLMs/Multitask-National-Speech-Corpus-v1-extend

Viewer • Updated Mar 31 • 15.2M • 7.86k • 1

binwang

updated a Space 5 months ago

Leaderboard / AudioBench

🥇

Explore various audio processing features

iris2c

authored 5 papers 5 months ago

Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Paper • 2305.10786 • Published May 18, 2023

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

Paper • 2312.11825 • Published Dec 19, 2023

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 54

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

Paper • 2501.10045 • Published Jan 17 • 9

InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation

Paper • 2503.00084 • Published Feb 28

binwang

authored 2 papers 6 months ago

MERaLiON-TextLLM: Cross-Lingual Understanding of Large Language Models in Chinese, Indonesian, Malay, and Singlish

Paper • 2501.08335 • Published Dec 21, 2024

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 100

binwang

authored 7 papers 8 months ago

Knowledge Graph Embedding: An Overview

Paper • 2309.12501 • Published Sep 21, 2023

CRAFT: Extracting and Tuning Cultural Instructions from the Wild

Paper • 2405.03138 • Published May 6, 2024 • 1

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Paper • 2406.10118 • Published Jun 14, 2024 • 33

AI & ML interests

Recent Activity

Team members 8

AudioLLMs's activity

License for Spoken-SQuAD

Leaderboard / AudioBench