Update README.md
Browse files
README.md
CHANGED
@@ -14,4 +14,8 @@ tags:
|
|
14 |
( _ \( ___)( ) ( ) /__\ (_ _)( _ \(_ _)( \/ )
|
15 |
) _ < )__) )(__ )(__ /(__)\ )( ) / _)(_ ) (
|
16 |
(____/(____)(____)(____)(__)(__)(__) (_)\_)(____)(_/\_)
|
17 |
-
</pre>
|
|
|
|
|
|
|
|
|
|
14 |
( _ \( ___)( ) ( ) /__\ (_ _)( _ \(_ _)( \/ )
|
15 |
) _ < )__) )(__ )(__ /(__)\ )( ) / _)(_ ) (
|
16 |
(____/(____)(____)(____)(__)(__)(__) (_)\_)(____)(_/\_)
|
17 |
+
</pre>
|
18 |
+
|
19 |
+
# **Bellatrix-Tiny-1B-v2**
|
20 |
+
|
21 |
+
Bellatrix is based on a reasoning-based model designed for the QWQ synthetic dataset entries. The pipeline's instruction-tuned, text-only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. These models outperform many of the available open-source options. Bellatrix is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions utilize supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF).
|