---
license: other
language:
- en
---
[ExLlamaV2](https://github.com/turboderp/exllamav2/tree/master#exllamav2) Models of [Gryphe's MythoMax L2 13B](https://huggingface.co/Gryphe/MythoMax-L2-13b).

Other quantized models are available from TheBloke: [GGML](https://huggingface.co/TheBloke/MythoMax-L2-13B-GGML) - [GPTQ](https://huggingface.co/TheBloke/MythoMax-L2-13B-GPTQ) - [GGUF](https://huggingface.co/TheBloke/MythoMax-L2-13B-GGUF) - [AWQ](https://huggingface.co/TheBloke/MythoMax-L2-13B-AWQ)

## Model details

| Branch                                                               | Bits | Perplexity  | Desc                                         |
|:--------------------------------------------------------------------:|:----:|:-----------:|:--------------------------------------------:|
| [main](https://huggingface.co/R136a1/MythoMax-L2-13B-exl2/tree/main) | 5    | idk, forgot | Idk why I made this, 1st try                 |
|                                                                      | 4    |             |                                              |
|                                                                      | 6.5  | 6.1074      | Can run 4096 context size (tokens) on T4 GPU |
|                                                                      | 7    | 6.1056      | 2048 max context size for T4 GPU             |
|                                                                      | 8    | 6.1027      | Just, why?                                   |

To be updated

## Prompt Format

This model primarily uses Alpaca formatting, so for optimal model performance, use:
```
<System prompt/Character Card>

### Instruction:
Your instruction or question here.
For roleplay purposes, I suggest the following - Write <CHAR NAME>'s next reply in a chat between <YOUR NAME> and <CHAR NAME>. Write a single reply only.

### Response:
```
 
---
license: other
---