File size: 8,820 Bytes
52b4e2b 3f0ae33 52b4e2b dc0d092 52b4e2b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 |
---
tags:
- not-for-all-audiences
- merge
- mergekit
base_model:
- openlynn/Llama-3-Soliloquy-8B-v2
- Undi95/Llama-3-LewdPlay-8B-evo
- NeverSleep/Llama-3-Lumimaid-8B-v0.1
- NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS
- dreamgen-preview/opus-v1.2-llama-3-8b-base-run3.4-epoch2
- dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5
- Sao10K/L3-8B-Stheno-v3.2
- mpasila/Llama-3-LiPPA-8B
- mpasila/Llama-3-Instruct-LiPPA-8B
- Abdulhanan2006/WaifuAI-L3-8B-8k
- Blackroot/Llama-3-8B-Abomination-LORA
- abacusai/Llama-3-Smaug-8B
- jondurbin/bagel-8b-v1.0
- TIGER-Lab/MAmmoTH2-8B-Plus
- VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
- failspy/Meta-Llama-3-8B-Instruct-abliterated-v3
- Undi95/Llama-3-Unholy-8B
- Undi95/Llama3-Unholy-8B-OAS
- Undi95/Unholy-8B-DPO-OAS
- Edgerunners/meta-llama-3-8b-instruct-hf-ortho-baukit-5fail-3000total-bf16
- vicgalle/Configurable-Llama-3-8B-v0.3
- lodrick-the-lafted/Limon-8B
- AwanLLM/Awanllm-Llama-3-8B-Cumulus-v1.0
- WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0
- migtissera/Tess-2.0-Llama-3-8B
- HPAI-BSC/Llama3-Aloe-8B-Alpha
- refuelai/Llama-3-Refueled
- Danielbrdz/Barcenas-Llama3-8b-ORPO
- lodrick-the-lafted/Olethros-8B
- migtissera/Llama-3-8B-Synthia-v3.5
- RLHFlow/LLaMA3-iterative-DPO-final
- chujiezheng/LLaMA3-iterative-DPO-final-ExPO
- princeton-nlp/Llama-3-Instruct-8B-SimPO
- chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO
license: llama3
language:
- en
---
# Merging Compute Sponsored by KoboldAI
![Model Tree](https://huggingface.co/PJMixers/LLaMa-3-CursedStock-v1.8-8B/resolve/main/model_tree.png)
---
Refer to the original models for best usage.
- [openlynn/Llama-3-Soliloquy-8B-v2](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v2)
- [Undi95/Llama-3-LewdPlay-8B-evo](https://huggingface.co/Undi95/Llama-3-LewdPlay-8B-evo)
- [NeverSleep/Llama-3-Lumimaid-8B-v0.1](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1)
- [NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS)
- [dreamgen-preview/opus-v1.2-llama-3-8b-base-run3.4-epoch2](https://huggingface.co/dreamgen-preview/opus-v1.2-llama-3-8b-base-run3.4-epoch2)
- [dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5](https://huggingface.co/dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5)
- [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
- [mpasila/Llama-3-LiPPA-8B](https://huggingface.co/mpasila/Llama-3-LiPPA-8B)
- [mpasila/Llama-3-Instruct-LiPPA-8B](https://huggingface.co/mpasila/Llama-3-Instruct-LiPPA-8B)
- [Abdulhanan2006/WaifuAI-L3-8B-8k](https://huggingface.co/Abdulhanan2006/WaifuAI-L3-8B-8k)
- [Blackroot/Llama-3-8B-Abomination-LORA](https://huggingface.co/Blackroot/Llama-3-8B-Abomination-LORA)
- [abacusai/Llama-3-Smaug-8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B)
- [jondurbin/bagel-8b-v1.0](https://huggingface.co/jondurbin/bagel-8b-v1.0)
- [TIGER-Lab/MAmmoTH2-8B-Plus](https://huggingface.co/TIGER-Lab/MAmmoTH2-8B-Plus)
- [VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct)
- [failspy/Meta-Llama-3-8B-Instruct-abliterated-v3](https://huggingface.co/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3)
- [Undi95/Llama-3-Unholy-8B](https://huggingface.co/Undi95/Llama-3-Unholy-8B)
- [Undi95/Llama3-Unholy-8B-OAS](https://huggingface.co/Undi95/Llama3-Unholy-8B-OAS)
- [Undi95/Unholy-8B-DPO-OAS](https://huggingface.co/Undi95/Unholy-8B-DPO-OAS)
- [Edgerunners/meta-llama-3-8b-instruct-hf-ortho-baukit-5fail-3000total-bf16](https://huggingface.co/Edgerunners/meta-llama-3-8b-instruct-hf-ortho-baukit-5fail-3000total-bf16)
- [vicgalle/Configurable-Llama-3-8B-v0.3](https://huggingface.co/vicgalle/Configurable-Llama-3-8B-v0.3)
- [lodrick-the-lafted/Limon-8B](https://huggingface.co/lodrick-the-lafted/Limon-8B)
- [AwanLLM/Awanllm-Llama-3-8B-Cumulus-v1.0](https://huggingface.co/AwanLLM/Awanllm-Llama-3-8B-Cumulus-v1.0)
- [WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0](https://huggingface.co/WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0)
- [migtissera/Tess-2.0-Llama-3-8B](https://huggingface.co/migtissera/Tess-2.0-Llama-3-8B)
- [HPAI-BSC/Llama3-Aloe-8B-Alpha](https://huggingface.co/HPAI-BSC/Llama3-Aloe-8B-Alpha)
- [refuelai/Llama-3-Refueled](https://huggingface.co/refuelai/Llama-3-Refueled)
- [Danielbrdz/Barcenas-Llama3-8b-ORPO](https://huggingface.co/Danielbrdz/Barcenas-Llama3-8b-ORPO)
- [lodrick-the-lafted/Olethros-8B](https://huggingface.co/lodrick-the-lafted/Olethros-8B)
- [migtissera/Llama-3-8B-Synthia-v3.5](https://huggingface.co/migtissera/Llama-3-8B-Synthia-v3.5)
- [RLHFlow/LLaMA3-iterative-DPO-final](https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final)
- [chujiezheng/LLaMA3-iterative-DPO-final-ExPO](https://huggingface.co/chujiezheng/LLaMA3-iterative-DPO-final-ExPO)
- [princeton-nlp/Llama-3-Instruct-8B-SimPO](https://huggingface.co/princeton-nlp/Llama-3-Instruct-8B-SimPO)
- [chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO](https://huggingface.co/chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO)
---
# Mergekit Recipe
```yaml
# Includes Prompt Format Types
merge_method: model_stock
base_model: NousResearch/Meta-Llama-3-8B
dtype: bfloat16
models:
# RP
- model: openlynn/Llama-3-Soliloquy-8B-v2 # LLaMa-3-Instruct
- model: Undi95/Llama-3-LewdPlay-8B-evo # LLaMa-3-Instruct
- model: NeverSleep/Llama-3-Lumimaid-8B-v0.1 # LLaMa-3-Instruct
- model: NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS # LLaMa-3-Instruct
- model: dreamgen-preview/opus-v1.2-llama-3-8b-base-run3.4-epoch2 # Possibly LLaMa-3-Instruct?
- model: dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5 # LLaMa-3-Instruct
- model: Sao10K/L3-8B-Stheno-v3.2 # LLaMa-3-Instruct
- model: mpasila/Llama-3-LiPPA-8B # LLaMa-3-Instruct (Unsloth changed assistant to gpt and user to human.)
- model: mpasila/Llama-3-Instruct-LiPPA-8B # LLaMa-3-Instruct (Unsloth changed assistant to gpt and user to human.)
- model: Abdulhanan2006/WaifuAI-L3-8B-8k # Possibly LLaMa-3-Instruct?
- model: NousResearch/Meta-Llama-3-8B-Instruct+Blackroot/Llama-3-8B-Abomination-LORA # LLaMa-3-Instruct
# Smart
- model: abacusai/Llama-3-Smaug-8B # Possibly LLaMa-3-Instruct?
- model: jondurbin/bagel-8b-v1.0 # LLaMa-3-Instruct
- model: TIGER-Lab/MAmmoTH2-8B-Plus # Possibly LLaMa-3-Instruct?
- model: VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct # LLaMa-3-Instruct
# Uncensored
- model: failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 # LLaMa-3-Instruct
- model: Undi95/Llama-3-Unholy-8B # LLaMa-3-Instruct
- model: Undi95/Llama3-Unholy-8B-OAS # LLaMa-3-Instruct
- model: Undi95/Unholy-8B-DPO-OAS # LLaMa-3-Instruct
- model: Edgerunners/meta-llama-3-8b-instruct-hf-ortho-baukit-5fail-3000total-bf16 # Possibly LLaMa-3-Instruct?
- model: vicgalle/Configurable-Llama-3-8B-v0.3 # LLaMa-3-Instruct
- model: lodrick-the-lafted/Limon-8B # LLaMa-3-Instruct
- model: AwanLLM/Awanllm-Llama-3-8B-Cumulus-v1.0 # LLaMa-3-Instruct
# Code
- model: WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0 # LLaMa-3-Instruct
- model: migtissera/Tess-2.0-Llama-3-8B # LLaMa-3-Instruct
# Med
- model: HPAI-BSC/Llama3-Aloe-8B-Alpha # LLaMa-3-Instruct
# Misc
- model: refuelai/Llama-3-Refueled # LLaMa-3-Instruct
- model: Danielbrdz/Barcenas-Llama3-8b-ORPO # LLaMa-3-Instruct
- model: lodrick-the-lafted/Olethros-8B # LLaMa-3-Instruct
- model: migtissera/Llama-3-8B-Synthia-v3.5 # LLaMa-3-Instruct
- model: RLHFlow/LLaMA3-iterative-DPO-final # LLaMa-3-Instruct
- model: chujiezheng/LLaMA3-iterative-DPO-final-ExPO # LLaMa-3-Instruct
- model: princeton-nlp/Llama-3-Instruct-8B-SimPO # LLaMa-3-Instruct
- model: chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO # LLaMa-3-Instruct
# v1.8
# - Only LLaMa-3-Instruct template models from now on. Not even gonna bother with the jank lol.
# - Add princeton-nlp/Llama-3-Instruct-8B-SimPO.
# - Add chujiezheng/LLaMA3-iterative-DPO-final-ExPO.
# - Add chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO.
# - Add AwanLLM/Awanllm-Llama-3-8B-Cumulus-v1.0 as it's trained on failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 with 8K length using rank 64 QLoRA and claims to be good at RP and storywriting.
# - Add Blackroot/Llama-3-8B-Abomination-LORA as it claims to be heavily trained for RP and storywriting.
# - Replaced Sao10K/L3-8B-Stheno-v3.1 with Sao10K/L3-8B-Stheno-v3.2.
# - Removed victunes/TherapyLlama-8B-v1 as it might be too specific for a general merge. It was also aparently vicuna format.
# - Removed ResplendentAI QLoRA models as they were trained on the base, but don't seem to train lm_head or embed_tokens.
# - Removed BeaverAI/Llama-3SOME-8B-v2-rc2 as newer versions are out, and idk which is best yet. Also don't want doubledipping if I decide to Beavertrain this.
``` |