ruslandev
/

llama-3-8b-gpt-4o-ru1.0

@@ -19,12 +19,25 @@ This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](http
 The idea behind this model is to train on a dataset derived from a smaller subset of the [tagengo-gpt4](https://huggingface.co/datasets/lightblue/tagengo-gpt4), but with improved data quality.
 I tried to achieve higher data quality by prompting GPT-4o, the latest OpenAI's LLM with better multilingual capabilities. The training objective is primarily focused on the Russian language (80% of the training examples).
 The model shows promising results on the MT-Bench evaluation benchmark, surpassing GPT-3.5-turbo and being on par with [Suzume](https://huggingface.co/lightblue/suzume-llama-3-8B-multilingual) in Russian language scores,
-even though the latter is trained on 8x bigger and more diverse dataset.
 ## Evaluation scores
 I achieved the following scores on Ru/En MT-Bench:
 |            |meta-llama/Meta-Llama-3-8B-Instruct | ruslandev/llama-3-8b-gpt-4o-ru1.0 | lightblue/suzume-llama-3-8B-multilingual | Nexusflow/Starling-LM-7B-beta | gpt-3.5-turbo |
 |:----------:|:----------------------------------:|:---------------------------------:|:----------------------------------------:|:-----------------------------:|:-------------:|
 | Russian 🇷🇺 | NaN                                | 8.12                              | 8.19                                     | 8.06                          | 7.94          |

 The idea behind this model is to train on a dataset derived from a smaller subset of the [tagengo-gpt4](https://huggingface.co/datasets/lightblue/tagengo-gpt4), but with improved data quality.
 I tried to achieve higher data quality by prompting GPT-4o, the latest OpenAI's LLM with better multilingual capabilities. The training objective is primarily focused on the Russian language (80% of the training examples).
 The model shows promising results on the MT-Bench evaluation benchmark, surpassing GPT-3.5-turbo and being on par with [Suzume](https://huggingface.co/lightblue/suzume-llama-3-8B-multilingual) in Russian language scores,
+even though the latter is trained on 8x bigger and more diverse dataset.
+## How to use
+The easiest way to use this model on your own computer is to use the GGUF version of this model ([ruslandev/llama-3-8b-gpt-4o-ru1.0-gguf](https://huggingface.co/ruslandev/llama-3-8b-gpt-4o-ru1.0-gguf)) using a program such as [llama.cpp](https://github.com/ggerganov/llama.cpp).
+If you want to use this model directly with the Huggingface Transformers stack, I recommend using my framework [gptchain](https://github.com/RuslanPeresy/gptchain).
+```
+git clone https://github.com/RuslanPeresy/gptchain.git
+cd gptchain
+pip install -r requirements-train.txt
+python gptchain.py chat -m ruslandev/llama-3-8b-gpt-4o-ru1.0-gguf \
+	--chatml true \
+	-q '[{"from": "human", "value": "Из чего состоит нейронная сеть?"}]'
+```
 ## Evaluation scores
 I achieved the following scores on Ru/En MT-Bench:
 |            |meta-llama/Meta-Llama-3-8B-Instruct | ruslandev/llama-3-8b-gpt-4o-ru1.0 | lightblue/suzume-llama-3-8B-multilingual | Nexusflow/Starling-LM-7B-beta | gpt-3.5-turbo |
 |:----------:|:----------------------------------:|:---------------------------------:|:----------------------------------------:|:-----------------------------:|:-------------:|
 | Russian 🇷🇺 | NaN                                | 8.12                              | 8.19                                     | 8.06                          | 7.94          |