MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published Jul 19 • 126
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings Paper • 2305.10786 • Published May 18, 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation Paper • 2312.11825 • Published Dec 19, 2023
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published Jan 10 • 54
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution Paper • 2501.10045 • Published Jan 17 • 9
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation Paper • 2503.00084 • Published Feb 28
MERaLiON-TextLLM: Cross-Lingual Understanding of Large Language Models in Chinese, Indonesian, Malay, and Singlish Paper • 2501.08335 • Published Dec 21, 2024
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10 • 100
CRAFT: Extracting and Tuning Cultural Instructions from the Wild Paper • 2405.03138 • Published May 6, 2024 • 1
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 33
AudioBench: A Universal Benchmark for Audio Large Language Models Paper • 2406.16020 • Published Jun 23, 2024
Evaluating Word Embedding Models: Methods and Experimental Results Paper • 1901.09785 • Published Jan 28, 2019
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs Paper • 2412.11699 • Published Dec 16, 2024 • 1