Base on https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0

Convert to onnx model using https://github.com/microsoft/onnxruntime-genai

Using command: python src/python/py/models/builder.py -m TinyLlama/TinyLlama-1.1B-Chat-v1.0 -o path-to-onnx-model -p int4 -e dml

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support