Casual-Autopsy
/

Llama-3-Yollisa-SCE

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Casual-Autopsy commited on 13 days ago

Commit

ab2f630

·

verified ·

1 Parent(s): b0efcff

Update README.md

Files changed (1) hide show

README.md +20 -12

README.md CHANGED Viewed

@@ -1,10 +1,18 @@
 ---
-base_model: []
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
 # merge2
@@ -13,14 +21,14 @@ This is a merge of pre-trained language models created using [mergekit](https://
 ## Merge Details
 ### Merge Method
-This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using /kaggle/input/meta-llama-3-8b/transformers/hf/1 as a base.
 ### Models Merged
 The following models were included in the merge:
-* /kaggle/input/llama-3-youko-8b/transformers/hf/1
-* /kaggle/input/shisa-v1-llama3-8b/transformers/hf/1
-* /kaggle/input/llama-3-swallow-8b-v0.1/transformers/hf/1
 ### Configuration
@@ -30,16 +38,16 @@ The following YAML configuration was used to produce this model:
 models:
   # Pivot model
-  - model: /kaggle/input/meta-llama-3-8b/transformers/hf/1
   # Target models
-  - model: /kaggle/input/shisa-v1-llama3-8b/transformers/hf/1
-  - model: /kaggle/input/llama-3-youko-8b/transformers/hf/1
-  - model: /kaggle/input/llama-3-swallow-8b-v0.1/transformers/hf/1
 merge_method: sce
-base_model: /kaggle/input/meta-llama-3-8b/transformers/hf/1
 parameters:
   select_topk: 0.65
   int8_mask: true
 dtype: bfloat16
-```

 ---
+base_model:
+- meta-llama/Meta-Llama-3-8B
+- shisa-ai/shisa-v1-llama3-8b
+- rinna/llama-3-youko-8b
+- tokyotech-llm/Llama-3-Swallow-8B-v0.1
 library_name: transformers
 tags:
 - mergekit
 - merge
+license: llama3
+language:
+- ja
+- en
+pipeline_tag: text-generation
 ---
 # merge2
 ## Merge Details
 ### Merge Method
+This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) as a base.
 ### Models Merged
 The following models were included in the merge:
+* [shisa-ai/shisa-v1-llama3-8b](https://huggingface.co/shisa-ai/shisa-v1-llama3-8b)
+* [rinna/llama-3-youko-8b](https://huggingface.co/rinna/llama-3-youko-8b)
+* [tokyotech-llm/Llama-3-Swallow-8B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-v0.1)
 ### Configuration
 models:
   # Pivot model
+  - model: meta-llama/Meta-Llama-3-8B
   # Target models
+  - model: shisa-ai/shisa-v1-llama3-8b
+  - model: rinna/llama-3-youko-8b
+  - model: tokyotech-llm/Llama-3-Swallow-8B-v0.1
 merge_method: sce
+base_model: meta-llama/Meta-Llama-3-8B
 parameters:
   select_topk: 0.65
   int8_mask: true
 dtype: bfloat16
+```