MLX
Safetensors
English
qwen2
Isaak Carter Augustus
619a190a02eb7ecef91bd4c7e2a37b1f5f1b9649492495b1d9bf334b658760b1
73e2c98 verified
|
raw
history blame
817 Bytes
metadata
language:
  - en
license: apache-2.0
tags:
  - mlx
datasets:
  - Locutusque/hercules-v5.0
inference:
  parameters:
    do_sample: true
    temperature: 0.8
    top_p: 0.95
    top_k: 40
    min_p: 0.1
    max_new_tokens: 250
    repetition_penalty: 1.1

mlx-community/Hercules-5.0-Qwen2-1.5B-4bits

The Model mlx-community/Hercules-5.0-Qwen2-1.5B-4bits was converted to MLX format from M4-ai/Hercules-5.0-Qwen2-1.5B using mlx-lm version 0.14.0.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Hercules-5.0-Qwen2-1.5B-4bits")
response = generate(model, tokenizer, prompt="hello", verbose=True)