Logo

🌟 BloomVN-8B-Chat-Reasoning

A fine-tuned multilingual model for Vietnamese reasoning

NOTE

  • This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
  • The full model will be updated soon.

πŸ“‹ Overview

  • A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
  • The model follows a structured XML format with explicit reasoning tags.
  • It's designed for educational applications and complex problem-solving tasks in Vietnamese.

πŸ”§ Method

  • Fine-tuned using Group Relative Policy Optimization (GRPO) with Unsloth for hardware efficiency.
  • Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
  • Uses LoRA adaptation on a Vietnamese dataset spanning various task types.

πŸ’« Quantization

Coming Soon!

🀝 Contributors

Developed with ❀️ by BlossomAI


Star ⭐️ this repo if you find it valuable!
Downloads last month
4
Safetensors
Model size
8.55B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for BlossomsAI/BloomVN-8B-Chat-Reasoning

Base model

Qwen/Qwen2.5-7B
Finetuned
sail/Sailor2-8B
Finetuned
(1)
this model
Quantizations
1 model