Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ More details on model performance across various devices, can be found
|
|
43 |
|
44 |
| Model | Device | Chipset | Target Runtime | Response Rate (Tokens/Second) | Time To First Token Range (Seconds) | Tiny MMLU
|
45 |
|---|---|---|---|---|---|---|
|
46 |
-
| Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 |
|
47 |
|
48 |
## Deploying Mistral 7B Instruct v3.0 on-device
|
49 |
Please follow [this tutorial](https://github.com/quic/ai-hub-apps/tree/main/tutorials/llama)
|
|
|
43 |
|
44 |
| Model | Device | Chipset | Target Runtime | Response Rate (Tokens/Second) | Time To First Token Range (Seconds) | Tiny MMLU
|
45 |
|---|---|---|---|---|---|---|
|
46 |
+
| Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |
|
47 |
|
48 |
## Deploying Mistral 7B Instruct v3.0 on-device
|
49 |
Please follow [this tutorial](https://github.com/quic/ai-hub-apps/tree/main/tutorials/llama)
|