--- base_model: - shisa-ai/shisa-v2-mistral-nemo-12b - inflatebot/MN-12B-Mag-Mell-R1 - yamatazen/Himeyuri-Magnum-12B - Elizezen/Himeyuri-v0.1-12B library_name: transformers tags: - mergekit - merge - chatml language: - en - ja --- ![image/png](https://huggingface.co/yamatazen/Twilight-SCE-12B/resolve/main/Twilight-SCE-12B.png?download=true) This is a Mistral model with ChatML tokens added to the tokenizer. # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [shisa-ai/shisa-v2-mistral-nemo-12b](https://huggingface.co/shisa-ai/shisa-v2-mistral-nemo-12b) as a base. ### Models Merged The following models were included in the merge: * [inflatebot/MN-12B-Mag-Mell-R1](https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1) * [yamatazen/Himeyuri-Magnum-12B](https://huggingface.co/yamatazen/Himeyuri-Magnum-12B) * [Elizezen/Himeyuri-v0.1-12B](https://huggingface.co/Elizezen/Himeyuri-v0.1-12B) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: shisa-ai/shisa-v2-mistral-nemo-12b models: - model: Elizezen/Himeyuri-v0.1-12B parameters: weight: 1.0 - model: yamatazen/Himeyuri-Magnum-12B parameters: weight: 0.6 - model: inflatebot/MN-12B-Mag-Mell-R1 parameters: weight: 0.3 merge_method: sce dtype: bfloat16 parameters: normalize: true select_topk: 0.5 tokenizer: source: union ```