🚀 NEW DROP: run your own on-device LLM—in minutes, on any phone Today we’re open-sourcing everything you need to put Qwen3-0.6B straight into a production-ready mobile app:
🎥 Watch Qwen3-0.6B chat in real time on any smartphones!
📊 TPS benchmarks – slides comparing token-per-second across heterogeneous mobile devices
💻 Plug-and-play source – Just Copy & Run the source to your project for Android (Kotlin & Java) and iOS (Swift).
🤞 Cross-platform, one pipeline – ZETIC.MLange auto-tunes kernels for every different devices, we’ve tested.
👨💻 Ready for production – swap in your own model, re-benchmark with one command, publish.
We built this to show that cloud-free LLMs are ready today. Dive in, fork it, and tag ZETIC.ai when you launch your own on-device assistant, game NPC, or offline content generator—we’ll spotlight the best projects.