NexaAI
/

Qwen3-4B-Thinking-2507-npu

Model card Files Files and versions

zackli4ai commited on 19 days ago

Commit

06d03b1

·

verified ·

1 Parent(s): 36ff29f

Update README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -24,6 +24,35 @@
 - Structured chain-of-thought (with `<think>…</think>` tags), followed by final answer or solution.
 - Note: The default template auto-inserts thinking behavior, so you may see only a closing `</think>` tag.
 ## License
 - Licensed under **Apache-2.0**

 - Structured chain-of-thought (with `<think>…</think>` tags), followed by final answer or solution.
 - Note: The default template auto-inserts thinking behavior, so you may see only a closing `</think>` tag.
+---
+## How to use
+> ⚠️ **Hardware requirement:** the model currently runs **only on Qualcomm NPUs** (e.g., Snapdragon-powered AIPC).
+> Apple NPU support is planned next.
+### 1) Install Nexa-SDK
+- Download and follow the steps under "Deploy Section" Nexa's model page:  [Download Windows arm64 SDK](https://sdk.nexa.ai/model/Qwen3-4B-Thinking-2507)
+- (Other platforms coming soon)
+### 2) Get an access token
+Create a token in the Model Hub, then log in:
+```bash
+nexa config set license '<access_token>'
+```
+### 3) Run the model
+Running:
+```bash
+nexa infer NexaAI/Qwen3-4B-Thinking-2507-npu
+```
+---
 ## License
 - Licensed under **Apache-2.0**