@erikkaum on Hugging Face: "ZML just released a technical preview of their new Inference Engine: LLMD. -…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

erikkaum

posted an update 17 days ago

Post

2543

ZML just released a technical preview of their new Inference Engine: LLMD.

- Just 2.4GB container, which means fast startup times and efficient autoscaling
- Cross-Platform GPU Support: works on both NVIDIA and AMD GPUs.
- written in Zig

I just tried it out and deployed it on Hugging Face Inference Endpoints and wrote a quick guide 👇 You can try it in like 5 minutes!

https://huggingface.co/blog/erikkaum/test-driving-llmd-inference-engine

AtAndDev

17 days ago

Zig ftw

In this post

erikkaum Erik Kaunismäki
AtAndDev alkinun