Upload 4 files
Browse files- .gitattributes +2 -0
- GCED3P230E6T114CS8HVEWR0V0.jpeg +3 -0
- README.md +74 -0
- character card/Wings%20of%20Fire_1.json +0 -0
- character card/Wings%20of%20Fire_1.png +3 -0
.gitattributes
CHANGED
@@ -34,3 +34,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
37 |
+
character[[:space:]]card/Wings%20of%20Fire_1.png filter=lfs diff=lfs merge=lfs -text
|
38 |
+
GCED3P230E6T114CS8HVEWR0V0.jpeg filter=lfs diff=lfs merge=lfs -text
|
GCED3P230E6T114CS8HVEWR0V0.jpeg
ADDED
![]() |
Git LFS Details
|
README.md
ADDED
@@ -0,0 +1,74 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: llama3.3
|
3 |
+
base_model: meta-llama/Llama-3.3-70B-Instruct
|
4 |
+
tags:
|
5 |
+
- llama-3.3
|
6 |
+
- finetune
|
7 |
+
- roleplay
|
8 |
+
- chat
|
9 |
+
- wings-of-fire
|
10 |
+
datasets:
|
11 |
+
- Darkhn/WOF_QA_V2
|
12 |
+
- Darkhn/WOF_Pretraining
|
13 |
+
- Darkhn/WOF_V3_Combined_Dataset
|
14 |
+
---
|
15 |
+
|
16 |
+
# Model Name - L3.3-70B-Animus-V2
|
17 |
+
|
18 |
+
<img src="GCED3P230E6T114CS8HVEWR0V0.jpeg" alt="Wings_of_Fire" width="700"/>
|
19 |
+
|
20 |
+
## Character Card & Lore Book
|
21 |
+
|
22 |
+
For the best roleplaying experience, it is highly recommended to use the provided character card and lore book. These files help guide the model's persona and provide rich, in-universe context.
|
23 |
+
|
24 |
+
[**Download the Character Card and Lore Book here**](https://huggingface.co/Darkhn/L3.3-70B-Animus-V2/tree/main/character%20card)
|
25 |
+
|
26 |
+
## Model Description
|
27 |
+
|
28 |
+
This is a fine-tuned version of `meta-llama/Llama-3.3-70B-Instruct` specialized for roleplaying and instruction-following within the *Wings of Fire* universe. This version represents a significant upgrade in data quality, roleplaying capability, and base model architecture.
|
29 |
+
|
30 |
+
The model was first adapted on a 3-million-token dataset extracted from the *Wings of Fire* book series to build a strong foundation of domain knowledge. It was then fine-tuned for 1 epoch on an expanded and cleaned dataset of conversational and roleplay examples.
|
31 |
+
|
32 |
+
The goal of this model is to provide a high-quality, immersive, and lore-accurate conversational experience. It can adopt character personas, answer questions about the world, engage in creative storytelling, portray multiple characters at once, and handle more mature themes from the series.
|
33 |
+
|
34 |
+
## Training Details
|
35 |
+
|
36 |
+
### Training Hardware
|
37 |
+
The model was fine-tuned on a single NVIDIA H100 GPU.
|
38 |
+
|
39 |
+
### Training Procedure
|
40 |
+
A QLoRA (Quantized Low-Rank Adaptation) approach was used for efficient fine-tuning, with an optimized process configured using Axolotl.
|
41 |
+
|
42 |
+
### Training Data
|
43 |
+
The training process involved two main stages:
|
44 |
+
|
45 |
+
1. **Domain Adaptation (Pre-training):** The base model was adapted to the *Wings of Fire* universe using the `Darkhn/WOF_Pretraining` dataset, containing **3 million tokens** compiled directly from the book series. This step saturated the model with the specific lore, characters, and writing style of the source material.
|
46 |
+
|
47 |
+
2. **Instruction & Chat Fine-tuning:** The model was fine-tuned for **1 epoch** on a mixed dataset of **5,000 examples**:
|
48 |
+
* **Roleplay Scenarios (4,200 examples):** From `Darkhn/WOF_V3_Combined_Dataset`. This new dataset features high-quality, multi-turn roleplay. It was specifically curated to teach the model advanced skills like portraying **multiple characters simultaneously** and handling the **more mature or 'darker' themes** (approx. 30% of examples) present in the book series. The data was cleaned to remove formatting artifacts like asterisks.
|
49 |
+
* **QA & Assistant (800 examples):** From `Darkhn/WOF_QA_V2`. These are instruction-response pairs focused on answering lore questions and following commands within the context of the *Wings of Fire* world.
|
50 |
+
|
51 |
+
## Intended Use & Limitations
|
52 |
+
|
53 |
+
* **Intended Use:** This model is intended for creative and roleplaying purposes within the *Wings of Fire* universe. It is designed for fans of the series and is not a general-purpose chatbot.
|
54 |
+
|
55 |
+
* **Limitations & Quirks:**
|
56 |
+
* Performance on tasks outside of its training domain (general knowledge, coding, etc.) is not guaranteed and will likely be poor.
|
57 |
+
* The model may "hallucinate" or generate plausible but non-canonical information.
|
58 |
+
* **Content:** The roleplay training data includes more mature and darker themes from the *Wings of Fire* series, such as character death, conflict, and moral ambiguity. The model is capable of generating content reflecting these themes. It can generate gratuitous or explicit content, as always its up to the user what they do with it.
|
59 |
+
* **Formatting:** The training data was cleaned to remove formatting artifacts like asterisks (`*...*`) for single word emphasis. The model should now produce cleaner, more narrative-style prose compared to previous versions.
|
60 |
+
* **Safety:** This model has not undergone additional safety alignment beyond what was included in its base Llama 3.3 model. Standard responsible AI practices should be followed.
|
61 |
+
|
62 |
+
## Recommended Sampler Settings
|
63 |
+
|
64 |
+
For optimal performance that balances creativity and coherence, the following default sampler settings are recommended.
|
65 |
+
|
66 |
+
* **Temperature:** 0.8-1.1
|
67 |
+
* **Min_P:** 0.02
|
68 |
+
* **DRY Sampler:**
|
69 |
+
* **Multiplier:** 0.8
|
70 |
+
* **Allowed Length:** 4
|
71 |
+
* **Base:** 1.75
|
72 |
+
|
73 |
+
## Acknowledgements
|
74 |
+
* Credit to Meta for the powerful Llama 3.3 architecture.
|
character card/Wings%20of%20Fire_1.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
character card/Wings%20of%20Fire_1.png
ADDED
![]() |
Git LFS Details
|