Edit model card

Convert from TinyLlama/TinyLlama-1.1B-Chat-v1.0 and 4 bits quantized.

Require onnxruntime>=0.17.0

Downloads last month
7
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for BricksDisplay/TinyLlama-1.1B-Chat-v1.0-q4

Quantized
this model

Collection including BricksDisplay/TinyLlama-1.1B-Chat-v1.0-q4