Qualcomm NPU
Collection
Latest SOTA models supported on Qualcomm NPU.
•
8 items
•
Updated
Qwen3-1.7B is a 1.7-billion-parameter general-purpose language model from the Qwen team at Alibaba Cloud.
Part of the Qwen3 series, it is designed to deliver strong language understanding, reasoning, and generation while being small enough for efficient deployment on a wider range of devices.
Trained on a large, high-quality multilingual dataset, Qwen3-1.7B supports diverse natural language processing tasks and can be fine-tuned for specialized domains.
Input:
Output:
⚠️ Hardware requirement: the model currently runs only on Qualcomm NPUs (e.g., Snapdragon-powered AIPC).
Apple NPU support is planned next.
Create a token in the Model Hub, then log in:
nexa config set license '<access_token>'
Running:
nexa infer NexaAI/qwen3-1.7B-npu