L1-aware Multilingual Mispronunciation Detection Framework Paper β’ 2309.07719 β’ Published Sep 14, 2023
SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation Paper β’ 2211.00923 β’ Published Nov 2, 2022
BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention Paper β’ 2505.13930 β’ Published 27 days ago
Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection Paper β’ 2502.03559 β’ Published Feb 5
Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic Paper β’ 2408.02430 β’ Published Aug 5, 2024
Speech Representation Analysis based on Inter- and Intra-Model Similarities Paper β’ 2406.16099 β’ Published Jun 23, 2024
The complementary roles of non-verbal cues for Robust Pronunciation Assessment Paper β’ 2309.07739 β’ Published Sep 14, 2023
Multi-View Multi-Task Representation Learning for Mispronunciation Detection Paper β’ 2306.01845 β’ Published Jun 2, 2023
MyVoice: Arabic Speech Resource Collaboration Platform Paper β’ 2308.02503 β’ Published Jul 23, 2023
QVoice: Arabic Speech Pronunciation Learning Application Paper β’ 2305.07445 β’ Published May 9, 2023
Towards a Unified Benchmark for Arabic Pronunciation Assessment: Quranic Recitation as Case Study Paper β’ 2506.07722 β’ Published 7 days ago
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Paper β’ 2504.06011 β’ Published Apr 8 β’ 1
TechniqueRAG: Retrieval Augmented Generation for Adversarial Technique Annotation in Cyber Threat Intelligence Text Paper β’ 2505.11988 β’ Published 30 days ago β’ 2
view post Post 635 Great efforts from @AtlasIA folks to adapt text2image models (ghibli style) for Moroccan ContextRead the blog is here : https://huggingface.co/blog/atlasia/creating-your-custom-ghibli-text-to-image-model See translation π 1 1 + Reply
SmolVLM: Redefining small and efficient multimodal models Paper β’ 2504.05299 β’ Published Apr 7 β’ 189