TheBloke
/

manticore-13b-chat-pyg-GGML

Model card Files Files and versions Community

TheBloke commited on May 28, 2023

Commit

02dd938

1 Parent(s): 1b2f7dc

Updating model files

Browse files

Files changed (1) hide show

README.md +28 -6

README.md CHANGED Viewed

@@ -17,6 +17,17 @@ language:
 library_name: transformers
 pipeline_tag: text-generation
 ---
 # Manticore 13B Chat GGML
@@ -62,23 +73,34 @@ GGML models can be loaded into text-generation-webui by installing the llama.cpp
 Further instructions here: [text-generation-webui/docs/llama.cpp-models.md](https://github.com/oobabooga/text-generation-webui/blob/main/docs/llama.cpp-models.md).
 # Original model card - Manticore 13B Chat
-Manticore 13B Chat builds on Manticore with new datasets, including a de-duped subset of the Pygmalion dataset. It also removes all Alpaca style prompts using `###` in favor of
 chat only style prompts using `USER:`,`ASSISTANT:` as well as [pygmalion/metharme prompting](https://huggingface.co/PygmalionAI/metharme-7b#prompting) using `<|system|>, <|user|> and <|model|>` tokens.
 Questions, comments, feedback, looking to donate, or want to help? Reach out on our [Discord](https://discord.gg/EqrvvehG) or email [[email protected]](mailto:[email protected])
 # Training Datasets
-Manticore 13B Chat is a Llama 13B model fine-tuned on the following datasets along with the datasets from the original Manticore 13B.
 **Manticore 13B Chat was trained on 25% of the datasets below. The datasets were merged, shuffled, and then sharded into 4 parts.**
 - de-duped pygmalion dataset, filtered down to RP data
-- [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
 - hellaswag, updated for detailed explanations w 30K+ rows
-- [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
 - [ewof/code-alpaca-instruct-unfiltered](https://huggingface.co/datasets/ewof/code-alpaca-instruct-unfiltered)
 Manticore 13B
@@ -110,8 +132,8 @@ Try out the model in HF Spaces. The demo uses a quantized GGML version of the mo
 ## Build
-Manticore was built with [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) on 8xA100 80GB
- - 3 epochs taking approximately 8 hours. No further epochs will be released.
  - The configuration to duplicate this build is provided in this repo's [/config folder](https://huggingface.co/openaccess-ai-collective/manticore-13b/tree/main/configs).
 ## Bias, Risks, and Limitations

 library_name: transformers
 pipeline_tag: text-generation
 ---
+<div style="width: 100%;">
+    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+    <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
+    </div>
+    <div style="display: flex; flex-direction: column; align-items: flex-end;">
+        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
+    </div>
+</div>
 # Manticore 13B Chat GGML
 Further instructions here: [text-generation-webui/docs/llama.cpp-models.md](https://github.com/oobabooga/text-generation-webui/blob/main/docs/llama.cpp-models.md).
+## Want to support my work?
+I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
+So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions, and I'll gladly quantise any model you'd like to try.
+* Patreon: coming soon! (just awaiting approval)
+* Ko-Fi: https://ko-fi.com/TheBlokeAI
+* Discord: https://discord.gg/UBgz4VXf
 # Original model card - Manticore 13B Chat
+Manticore 13B Chat builds on Manticore with new datasets, including a de-duped subset of the Pygmalion dataset. It also removes all Alpaca style prompts using `###` in favor of
 chat only style prompts using `USER:`,`ASSISTANT:` as well as [pygmalion/metharme prompting](https://huggingface.co/PygmalionAI/metharme-7b#prompting) using `<|system|>, <|user|> and <|model|>` tokens.
 Questions, comments, feedback, looking to donate, or want to help? Reach out on our [Discord](https://discord.gg/EqrvvehG) or email [[email protected]](mailto:[email protected])
 # Training Datasets
+Manticore 13B Chat is a Llama 13B model fine-tuned on the following datasets along with the datasets from the original Manticore 13B.
 **Manticore 13B Chat was trained on 25% of the datasets below. The datasets were merged, shuffled, and then sharded into 4 parts.**
 - de-duped pygmalion dataset, filtered down to RP data
+- [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
 - hellaswag, updated for detailed explanations w 30K+ rows
+- [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
 - [ewof/code-alpaca-instruct-unfiltered](https://huggingface.co/datasets/ewof/code-alpaca-instruct-unfiltered)
 Manticore 13B
 ## Build
+Manticore was built with [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) on 8xA100 80GB
+ - 3 epochs taking approximately 8 hours. No further epochs will be released.
  - The configuration to duplicate this build is provided in this repo's [/config folder](https://huggingface.co/openaccess-ai-collective/manticore-13b/tree/main/configs).
 ## Bias, Risks, and Limitations