Large scale finetune of Illustrious with state of the art techniques and performance.
Dataset of 4.5M pictures (0.8M with natural text captions) picked and balanced from 12M of anime art and other media, including private datasets. More detailed description on Civitai
Key advantages:
- Better prompt following
- Great aesthetic, anatomy, stability along with versatility
- Vibrant colors and smooth gradients without trace of burning
- Full brightness range even with epsilon
- Knowledge of tens of thousands style and almost any character.
An addition, comparing with vanilla Illustrious and NoobAI:
- No more annoying watermarks
- No tags bleed and better prompt segmentation
- No characters tags bleed and related side effects (unwanted outfits, style, composition changes)
- Better coherence and anatomy
- Artist styles look exactly as they should
- Each style including base is stable without random fluctuations on different seeds
- New knowledge
Features and prompting:
The model is designed to work both with short booru tag-based and long complex natural text prompts. The best result can be achieved using the combination of tags and some natural text phrases. For tags classic danbooru-style comma-separated tags without underscores were used.
Basic settings:
~1 megapixel for txt2img, any AR with resolution multiple of 64 (1024x1024, 1152x, 1216x832,...). Euler_a, CFG 4..8 for epsilon/3..5 for vpred, 20..28steps. LCM/PCM untested, cfg++ samplers work fine. Highresfix: x1.5 latent + denoise 0.6 or any gan + denoise 0.3..0.55.
Please note that vpred version requires a lower CFG value.
Examples can be found in image folder in repo.
Quality tags:
There are only 4:
masterpiece, best quality
for positive and
low quality, worst quality
for negative
Nothing else. Meta tags like lowres have been removed, do not use them. Low resolution images have been either removed or upscaled and cleaned with DAT depending on their importance
Negative prompt:
worst quality, low quality, watermark
For best results keep it as clean as possible. Spamming of popular sequences will not improve results, since all related flaws have been solved, but will only lead to unwanted effects, biases and poor quality.
Artist styles:
The model knows over 22k of artist styles. List, grids with example on Mega. Used with "by ", will not work properly without it.
0.6.1 vpred also has following styles:
by nyalia, by flooxyfloox, by koni, by truck-kun, by 748cm, by galawave, by aruhshura, by kyomu, by youlichu, by alens, by chlenix, by cleandongye, by fltccktl, by merratatustle, by xi410, by youmuanon, by memento mori
General styles:
2.5d, anime screencap, bold line, sketch, cgi, digital painting, flat colors, smooth shading, minimalistic, ink style, oil style, pastel style
Natural text:
Use it in combination with booru tags, works great. Use only natural text after typing styles and quality tags. Use just booru tags and forget about it, it's all up to you. Dataset contains over 800k of pitures with hybrid natural-text captions made by Opus-Vision, GPT-4o and ToriiGate
Brightness/colors/contrast:
You can use extra meta tags to control it:
low brightness, high brightness, low gamma, high gamma, sharp colors, soft colors, hdr, sdr, limited range
Vpred version:
Vpred version has index 0.6.1 because it was retrained from base for fix observed flaws, now it works flawlessly. To use it you need a latest dev build of a1111 or comfy or reforge. Do not forget to lower your CFG down to 3..5, higer values will lead to over-saturation.
Discord server
Safety:
Model tends to generate NSFW images for corresponding prompts, consider to add extra filtering. Outputs may be inacurate and provocative and must not be used as a reference.
License:
Same as illustrious, please check out original page for limitation. Fell free to use in your merges, finetunes, ets. just please leave a link.
Donations:
BTC bc1qwv83ggq8rvv07uk6dv4njs0j3yygj3aax4wg6c
ETH/USDT(e) 0x04C8a749F49aE8a56CB84cF0C99CD9E92eDB17db
- Downloads last month
- 12
Model tree for Minthy/RouWei-0.6
Base model
KBlueLeaf/kohaku-xl-beta5