--- base_model: [] library_name: transformers tags: - mergekit - merge --- # merged-fineweb-edu-250203 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using ../models/Llama-3.1-8B as a base. ### Models Merged The following models were included in the merge: * ../models/fineweb-edu-250203 * ../models/Llama-3.1-8B-Instruct ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: ../models/fineweb-edu-250203 parameters: density: 1 # density gradient weight: 1 - model: ../models/Llama-3.1-8B-Instruct parameters: density: 1 weight: 1 # weight gradient merge_method: ties base_model: ../models/Llama-3.1-8B parameters: weight: 1 density: 1 normalize: true ```