MLX
Safetensors
English
qwen2
Isaak Carter Augustus
619a190a02eb7ecef91bd4c7e2a37b1f5f1b9649492495b1d9bf334b658760b1
73e2c98 verified
|
raw
history blame
817 Bytes
---
language:
- en
license: apache-2.0
tags:
- mlx
datasets:
- Locutusque/hercules-v5.0
inference:
parameters:
do_sample: true
temperature: 0.8
top_p: 0.95
top_k: 40
min_p: 0.1
max_new_tokens: 250
repetition_penalty: 1.1
---
# mlx-community/Hercules-5.0-Qwen2-1.5B-4bits
The Model [mlx-community/Hercules-5.0-Qwen2-1.5B-4bits](https://huggingface.co/mlx-community/Hercules-5.0-Qwen2-1.5B-4bits) was converted to MLX format from [M4-ai/Hercules-5.0-Qwen2-1.5B](https://huggingface.co/M4-ai/Hercules-5.0-Qwen2-1.5B) using mlx-lm version **0.14.0**.
## Use with mlx
```bash
pip install mlx-lm
```
```python
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/Hercules-5.0-Qwen2-1.5B-4bits")
response = generate(model, tokenizer, prompt="hello", verbose=True)
```