abideen
/

MonarchCoder-MoE-2x7B

Text Generation

Mixture of Experts

mlabonne/AlphaMonarch-7B

Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

abideen commited on Mar 10, 2024

Commit

3ed677a

·

verified ·

1 Parent(s): 1d71eac

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -11,14 +11,21 @@ tags:
 base_model:
 - mlabonne/AlphaMonarch-7B
 - Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0
 ---
 # MonarchCoder-MoE-2x7B
 MonarchCoder-MoE-2x7B is a Mixure of Experts (MoE) made with the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B)
 * [Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0](https://huggingface.co/Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0)
 ## 🧩 Configuration
 ```yaml

 base_model:
 - mlabonne/AlphaMonarch-7B
 - Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0
+language:
+- en
+library_name: transformers
 ---
 # MonarchCoder-MoE-2x7B
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e380b2e12618b261fa6ba0/eoHRSEuT-_TtlrPX7PrOW.jpeg)
 MonarchCoder-MoE-2x7B is a Mixure of Experts (MoE) made with the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B)
 * [Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0](https://huggingface.co/Syed-Hasan-8503/Tess-Coder-7B-Mistral-v1.0)
+The main aim behind creating this model is to create a model that performs well in reasoning, conversation, and coding. AlphaMonarch pperforms amazing on reasoning and conversation tasks. Merging AlphaMonarch with a coding model yielded MonarchCoder-2x7B which performs better on OpenLLM, Nous, and HumanEval benchmark.
 ## 🧩 Configuration
 ```yaml