1 2 2

Quan

wq2012

https://wangquan.me/

wq2012

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

google/gemma-3n-E4B-it-litert-preview

upvoted a paper 4 months ago

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

authored a paper 6 months ago

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

View all activity

Organizations

wq2012's activity

liked a model about 1 month ago

google/gemma-3n-E4B-it-litert-preview

Image-Text-to-Text • Updated 25 days ago • 1.16k

upvoted a paper 4 months ago

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

Paper • 2502.11271 • Published Feb 16 • 18

authored 2 papers 6 months ago

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

Paper • 2312.11123 • Published Dec 18, 2023

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Paper • 2201.03713 • Published Jan 11, 2022

updated 2 models 9 months ago

tflite-hub/conformer-lang-id

Updated Sep 19, 2024 • 38

tflite-hub/conformer-speaker-encoder

Updated Sep 19, 2024 • 86 • 5

commented a paper 9 months ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 14 •

updated a collection 9 months ago

DiarizationLM

Collection

5 items • Updated Sep 19, 2024

updated 3 Spaces 9 months ago

authored 2 papers 9 months ago

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

Paper • 2104.02125 • Published Apr 5, 2021

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Paper • 2202.12163 • Published Feb 24, 2022

liked a Space 9 months ago

880

Open ASR Leaderboard

🏆

Request evaluation for a speech model

updated a model 10 months ago

google/DiarizationLM-13b-Fisher-v1

Text Generation • Updated Aug 11, 2024 • 129 • 11

updated a Space 11 months ago

DiarizationLM GGUF

💬

Generate detailed speaker diarization from text input💬

updated a model 11 months ago

google/DiarizationLM-8b-Fisher-v2

Updated Aug 2, 2024 • 4.23k • 30

updated a collection 11 months ago

DiarizationLM

Collection

5 items • Updated Sep 19, 2024

updated a model 11 months ago

google/DiarizationLM-8b-Fisher-v1

Text Generation • Updated Aug 2, 2024 • 139 • 3

upvoted a paper 11 months ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 14