Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mesolitica
's Collections
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM
Malaysian Reasoning
updated
7 days ago
Full parameter post training using SFT warmup and GRPO.
Upvote
1
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-SFT
2B
•
Updated
13 days ago
•
135
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-GRPO
2B
•
Updated
13 days ago
•
54
mesolitica/Malaysian-Qwen2.5-7B-Reasoning-SFT
8B
•
Updated
13 days ago
•
1.84k
•
1
mesolitica/Malaysian-Qwen2.5-7B-Dialect-Reasoning-GRPO
8B
•
Updated
27 days ago
•
77
•
3
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-SFT
15B
•
Updated
13 days ago
•
1.87k
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-GRPO
15B
•
Updated
12 days ago
•
63
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections