OwenArli commited on
Commit
f186021
1 Parent(s): 3f0780e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -3
README.md CHANGED
@@ -1,3 +1,45 @@
1
- ---
2
- license: llama3.1
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ ---
4
+
5
+ # Llama-3.1-8B-ArliAI-RPMax-v1.1
6
+ =====================================
7
+
8
+ ## Overview
9
+
10
+ This repository is based on the Meta-Llama-3.1-8B-Instruct model and is governed by the Meta Llama 3.1 License agreement: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
11
+
12
+ ## Model Description
13
+
14
+ Llama-3.1-8B-ArliAI-RPMax-v1.1 is a variant of the Meta-Llama-3.1-8B model, trained on a diverse set of curated RP datasets with a focus on variety and deduplication. This model is designed to be highly creative and non-repetitive, with a unique approach to training that minimizes repetition.
15
+
16
+ ### Training Details
17
+
18
+ * **Sequence Length**: 8192
19
+ * **Training Duration**: Approximately 1 day on 2x3090Ti
20
+ * **Epochs**: 1 epoch training for minimized repetition sickness
21
+ * **LORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
22
+
23
+ ## Quantization
24
+
25
+ The model is available in two quantized formats:
26
+
27
+ * **FP16**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-Formax-v1.1
28
+ * **GGUF**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-Formax-v1.1-GGUF
29
+
30
+ ## Suggested Prompt Format
31
+
32
+ Llama 3 Instruct Format
33
+
34
+ Example:
35
+ ```
36
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
37
+
38
+ You are [character]. You have a personality of [personality description]. [Describe scenario]<|eot_id|><|start_header_id|>user<|end_header_id|>
39
+
40
+ {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
41
+
42
+ {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
43
+
44
+ {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
45
+ ```