mergekit-community
/

mergekit-dare_ties-lmociuf

Text Generation

text-generation-inference

Model card Files Files and versions Community

mergekit-dare_ties-lmociuf / README.md

mergekit-uploader's picture

mergekit-uploader

Upload folder using huggingface_hub

5107cd9 verified 3 months ago

|

2.39 kB

	---
	base_model:
	- ReadyArt/Forgotten-Safeword-24B-3.6
	- huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated
	- mistralai/Mistral-Small-24B-Base-2501
	- TheDrummer/Cydonia-24B-v2.1
	- PocketDoc/Dans-PersonalityEngine-V1.2.0-24b
	library_name: transformers
	tags:
	- mergekit
	- merge

	---
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [DARE TIES](https://arxiv.org/abs/2311.03099) merge method using [mistralai/Mistral-Small-24B-Base-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501) as a base.

	### Models Merged

	The following models were included in the merge:
	* [ReadyArt/Forgotten-Safeword-24B-3.6](https://huggingface.co/ReadyArt/Forgotten-Safeword-24B-3.6)
	* [huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated](https://huggingface.co/huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated)
	* [TheDrummer/Cydonia-24B-v2.1](https://huggingface.co/TheDrummer/Cydonia-24B-v2.1)
	* [PocketDoc/Dans-PersonalityEngine-V1.2.0-24b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.2.0-24b)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: mistralai/Mistral-Small-24B-Base-2501
	# No parameters necessary for the base model
	- model: huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated
	parameters:
	density: 0.5 # Retaining 50% of this model's parameters
	weight: 0.1 # Lower influence
	- model: TheDrummer/Cydonia-24B-v2.1 # Highest influence
	parameters:
	density: 0.9 # Retaining 90% of this model's parameters
	weight: 0.4 # Highest influence
	- model: PocketDoc/Dans-PersonalityEngine-V1.2.0-24b # Second highest influence
	parameters:
	density: 0.7 # Retaining 70% of this model's parameters
	weight: 0.3 # Second highest influence
	- model: ReadyArt/Forgotten-Safeword-24B-3.6
	parameters:
	density: 0.6 # Retaining 60% of this model's parameters
	weight: 0.2 # Moderate influence

	merge_method: dare_ties
	base_model: mistralai/Mistral-Small-24B-Base-2501
	parameters:
	normalize: true # Normalizes parameter scaling for consistency
	int8_mask: true # Optimizes for memory-efficient int8 operations
	dtype: bfloat16 # Maintains computations in bfloat16 for performance efficiency
	```