Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mesolitica
's Collections
Speech Tokenizer
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM
Malaysian Reasoning
updated
Jun 24
Full parameter post training using SFT warmup and GRPO.
Upvote
1
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-SFT
2B
•
Updated
Jun 18
•
4
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-GRPO
2B
•
Updated
Jun 18
•
23
mesolitica/Malaysian-Qwen2.5-7B-Reasoning-SFT
8B
•
Updated
Jun 18
•
372
•
1
mesolitica/Malaysian-Qwen2.5-7B-Dialect-Reasoning-GRPO
8B
•
Updated
Jun 4
•
2
•
3
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-SFT
15B
•
Updated
Jun 18
•
327
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-GRPO
15B
•
Updated
Jun 18
•
9
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections