BlossomsAI
/

BloomVN-8B-Chat-Reasoning

Text Generation

text-generation-inference

Model card Files Files and versions Community

🌟 BloomVN-8B-Chat-Reasoning

A fine-tuned multilingual model for Vietnamese reasoning

NOTE

This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
The full model will be updated soon.

📋 Overview

A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
The model follows a structured XML format with explicit reasoning tags.
It's designed for educational applications and complex problem-solving tasks in Vietnamese.

🔧 Method

Fine-tuned using Group Relative Policy Optimization (GRPO) with Unsloth for hardware efficiency.
Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
Uses LoRA adaptation on a Vietnamese dataset spanning various task types.

💫 Quantization

Coming Soon!

🤝 Contributors

Developed with ❤️ by BlossomAI

_{Star ⭐️ this repo if you find it valuable!}

Downloads last month: 4

Safetensors

Model size

8.55B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for BlossomsAI/BloomVN-8B-Chat-Reasoning

Base model

Qwen/Qwen2.5-7B

Finetuned

sail/Sailor2-8B

Finetuned

sail/Sailor2-8B-Chat

Finetuned

BlossomsAI/BloomVN-8B-chat

Finetuned

(1)

this model

Quantizations

1 model