eagle0504
/

qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small

Model card Files Files and versions Community

eagle0504 commited on Mar 3

Commit

aeada4e

·

verified ·

1 Parent(s): c43c4a1

Update README.md

Files changed (1) hide show

README.md +78 -1

README.md CHANGED Viewed

@@ -40,7 +40,84 @@ You will need:
 - Together.AI API Key
 - Unsloth package
-## How to Use Model for Inference
 You can load this model with `transformers`:

 - Together.AI API Key
 - Unsloth package
+## How to Use Model via Terminal (Mac)
+**Goal** Run Qwen-2.5-3B Instruct on Your Mac Using `llama.cpp`
+Yes! You can run **Qwen-2.5-3B Instruct** on your Mac using `llama.cpp`. Here’s a step-by-step guide assuming you are starting from a clean macOS installation with only `pyenv` installed.
+### **Step 1: Install Homebrew (if not installed)**
+Homebrew is required to install `llama.cpp`.
+1. Open **Terminal** (`Cmd + Space`, type `Terminal`, and press **Enter**).
+2. Run:
+   ```sh
+   /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
+   ```
+3. After installation, add Homebrew to your PATH:
+   ```sh
+   echo 'eval "$(/opt/homebrew/bin/brew shellenv)"' >> ~/.zprofile
+   eval "$(/opt/homebrew/bin/brew shellenv)"
+   ```
+---
+### **Step 2: Install `llama.cpp` via Homebrew**
+Run:
+```sh
+brew install llama.cpp
+```
+Once installed, you should be able to use `llama-cli`.
+---
+### **Step 3: Run Qwen-2.5-3B Instruct with `llama-cli`**
+To run the model, execute:
+```sh
+llama-cli -hf eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small:Q8_0
+```
+---
+### **Step 4: Additional Configurations (If Needed)**
+If you encounter issues or need finer control, you may want to:
+#### **A. Verify Installation**
+Check if `llama-cli` is installed:
+```sh
+llama-cli --version
+```
+If you see a version output, it’s installed correctly.
+#### **B. Run with Explicit Model Path**
+If the default Hugging Face loader doesn't work, you can manually download the model:
+1. **Create a models directory:**
+   ```sh
+   mkdir -p ~/llama_models && cd ~/llama_models
+   ```
+2. **Download the GGUF model file** from [Hugging Face](https://huggingface.co/eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small):
+   ```sh
+   wget https://huggingface.co/eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small/resolve/main/Q8_0.gguf
+   ```
+3. **Run the model manually**:
+   ```sh
+   llama-cli -m ~/llama_models/Q8_0.gguf
+   ```
+---
+### **Step 5: Test the Model**
+Try prompting it:
+```sh
+llama-cli -m ~/llama_models/Q8_0.gguf -p "Explain quantum computing in simple terms."
+```
+or interactively:
+```sh
+llama-cli -m ~/llama_models/Q8_0.gguf --interactive
+```
+## How to Use Model via Python
 You can load this model with `transformers`: