Nguyen Van Anh Tuan's picture

Nguyen Van Anh Tuan

tuanio

·

https://tuanio.github.io/

tuanio

AI & ML interests

Natural Language Processing and Speech Processing

Recent Activity

liked a model 1 day ago

MERaLiON/MERaLiON-SpeechEncoder-2

liked a model 9 days ago

bosonai/higgs-audio-v2-generation-3B-base

liked a model 11 days ago

vidore/colqwen-omni-v0.1

View all activity

Organizations

upvoted a collection 11 days ago

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 11 days ago • 28

upvoted a collection about 2 months ago

MERaLiON-2

3 items • Updated May 28 • 2

upvoted a collection 4 months ago

Vietnamese speech dataset

for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. • 34 items • Updated 24 days ago • 26

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

upvoted 2 collections 4 months ago

VoxPopuli v2

A collection of checkpoints from the second VoxPopuli release. • 35 items • Updated Jan 16, 2024 • 6

Speech-to-Text Translation

5 items • Updated Sep 27, 2024 • 1

upvoted a paper 4 months ago

Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages

Paper • 2503.23542 • Published Mar 30 • 10

upvoted a collection 4 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 11 days ago • 155

upvoted a collection 5 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 244

upvoted a collection 8 months ago

WhisperLah

A collection of Whisper-variants for Singapore languages, e.g. English, Mandarin, Bahasa Malaysia, Tamil • 3 items • Updated Nov 27, 2024 • 1

upvoted a collection 9 months ago

Whisper pruned

Pruned / trimmed versions of whisper models with unnecessary languages removed. • 5 items • Updated Jan 30 • 1

upvoted a collection 10 months ago

distil-large-v3

This collection contains the model repositories for distil-large-v3, which provides support for the most popular Whisper libraries. • 4 items • Updated Mar 21, 2024 • 6

upvoted a collection 11 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 11 days ago • 368

upvoted a collection 12 months ago

MaLLaM 🌙

Pretrain from scratch 4096 context length on 90B tokens Malaysian text, https://huggingface.co/papers/2401.14680 • 10 items • Updated Jun 24 • 15

upvoted 2 collections about 1 year ago

MoE-LLaVA Model

9 items • Updated Jun 25 • 11

VinaLLaMA

Second Generation, Most Powerful Open-Source Vietnamese LLMs. • 8 items • Updated Feb 9, 2024 • 13

upvoted an article over 1 year ago

Article

The Annotated Diffusion Model

By

and 1 other •

Jun 7, 2022

• 250