Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,84 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- KBlueLeaf/danbooru2023-sqlite
|
4 |
+
language:
|
5 |
+
- en
|
6 |
+
library_name: diffusers
|
7 |
+
---
|
8 |
+
|
9 |
+
# Kohaku XL Gamma
|
10 |
+
|
11 |
+
A SDXL anime base model aims to create unique artworks.
|
12 |
+
<div style="display:flex; flex-direction: row; width: 100%;">
|
13 |
+
<div><img style="display:block;" src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/afc1dc39-3a1c-4db9-9942-0b2c150124ce/original=true/00946.jpeg"/></div>
|
14 |
+
<div><img style="display:block;" src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/20751b43-3d91-4802-b5cb-ce3857d1027a/original=true/00937.jpeg"/></div>
|
15 |
+
<div><img style="display:block;" src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/fade32c7-7cb8-4a69-84f3-d3da0f4b319d/original=true/00943.jpeg"/></div>
|
16 |
+
</div>
|
17 |
+
|
18 |
+
---
|
19 |
+
|
20 |
+
## Introduction
|
21 |
+
|
22 |
+
This model can be seen as a derivative of Animagine XL 3.0 project.
|
23 |
+
Basically I'm collaborating with Linaqruf for making better Anime base model (and it is obvious that we have different goal/target)
|
24 |
+
We share our models and technique to improve our models' quality.
|
25 |
+
And that is also how this model been created.
|
26 |
+
|
27 |
+
|
28 |
+
### Base7
|
29 |
+
|
30 |
+
Kohaku-XL base7 is resumed from beta7 and use same dataset that beta series have used. But this time I use my own metadata system to create captions. (Can be taken as advanced version of what linaqruf used, will open source it soon)
|
31 |
+
|
32 |
+
The metadata database can be downloaded here:
|
33 |
+
KBlueLeaf/danbooru2023-sqlite 路 Datasets at Hugging Face
|
34 |
+
|
35 |
+
Trainin details:
|
36 |
+
LR: 8e-6/2e-6
|
37 |
+
Scheduler: constant with warmup
|
38 |
+
Batch size: 128 (batch size 4 * grad acc 16 * gpu count 2)
|
39 |
+
|
40 |
+
|
41 |
+
### Gamma rev1
|
42 |
+
|
43 |
+
Kohaku-XL Gamma rev1 is a merged model which combine the learned diff from anxl3 and kohaku xl base 7. With this forumla:
|
44 |
+
|
45 |
+
gamma rev1 = beta7 + 0.8 * (anxl3 - anxl2) + 0.5 * (base7 - beta7)
|
46 |
+
|
47 |
+
---
|
48 |
+
|
49 |
+
## Usage
|
50 |
+
|
51 |
+
This model use my own system for quality tags or something like that.
|
52 |
+
So although this model combine the diff weight from anxl3, I will still recommend user to use mine (or both) tagging system.
|
53 |
+
|
54 |
+
The format of prompt is as same as anxl3. (You can check the sample images I post)
|
55 |
+
|
56 |
+
Rating tags:
|
57 |
+
* General: safe
|
58 |
+
* Sensitive: sensitive
|
59 |
+
* Questionable: nsfw
|
60 |
+
* Explicit: explicit, nsfw
|
61 |
+
|
62 |
+
Quality tags (Better to worse):
|
63 |
+
* Masterpiece
|
64 |
+
* best quality
|
65 |
+
* great quality
|
66 |
+
* good quality
|
67 |
+
* normal quality
|
68 |
+
* low quality
|
69 |
+
* worst quality
|
70 |
+
|
71 |
+
|
72 |
+
Year tags (New to Old):
|
73 |
+
* newest
|
74 |
+
* recent
|
75 |
+
* mid
|
76 |
+
* early
|
77 |
+
* old
|
78 |
+
|
79 |
+
---
|
80 |
+
## Future plan
|
81 |
+
|
82 |
+
Since my dataset have some resize/webp artifacts that will harm the models. I will recreate my dataset based on my new system (and opensource it once I done it).
|
83 |
+
|
84 |
+
The next plan is to train model on larger (3M~6M) dataset with better configuration (which will require A100s and I plan to spend about 2000~10000 USD on it, if you like my works, consider to sponsor me via buy-me-a-coffee or some BTC-sutff)
|