ganchito
/

dante-7b.gguf

@@ -1,69 +1,82 @@
-# Dante-7B (GGUF) – for Ollama
-This repository contains a **quantized and GGUF-converted** version of the [Dante-7B](https://huggingface.co/outflanknl/Dante-7B) model (based on **Qwen2** architecture), optimized for use with [Ollama](https://ollama.ai) or any `llama.cpp`-compatible backend.
-## 📦 Model origin
-- Original model: [`outflanknl/Dante-7B`](https://huggingface.co/outflanknl/Dante-7B)
-- Architecture: **Qwen2 7B**
-- Converted to **GGUF** format using the official [`llama.cpp`](https://github.com/ggerganov/llama.cpp) conversion script:
   ```bash
   python3 convert_hf_to_gguf.py /path/to/original/model --outfile Dante-7B.gguf
   ```
 ## 🔧 Quantization
-The model has been quantized to reduce size and improve inference speed while preserving quality.
-## 📂 Repository contents
-- `Dante-7B.gguf` → Model ready to use with Ollama/llama.cpp
-- Example `Modelfile` for Ollama
-## 🚀 Using in Ollama
-1. Clone or download this repository:
-   ```bash
-   git lfs install
-   git clone https://huggingface.co/ganchito/dante-7b.gguf
-   cd dante-7b.gguf
-   ```
-2. Create a `Modelfile` (example included in this repo):
-   ```dockerfile
-   FROM Dante-7B.gguf
-   # Model configuration
-   PARAMETER stop "<|im_end|>"
-   PARAMETER stop "<|endoftext|>"
-   PARAMETER stop "<|im_start|>"
-   PARAMETER stop "<|endoftext|>"
-   SYSTEM \"\"\"You are Dante, a 7B parameter language model based on Qwen2 architecture.
-   You are a helpful, creative, and intelligent AI assistant.
-   You can engage in conversations, answer questions, help with tasks, and provide thoughtful responses.
-   Always be respectful, honest, and helpful while maintaining a conversational and engaging tone.\"\"\"
-   TEMPLATE \"\"\"{{ if .System }}<|im_start|>system
-   {{ .System }}<|im_end|>
-   {{ end }}{{ if .Prompt }}<|im_start|>user
-   {{ .Prompt }}<|im_end|>
-   {{ end }}<|im_start|>assistant
-   {{ .Response }}<|im_end|>\"\"\"
-   PARAMETER temperature 0.7
-   PARAMETER top_p 0.9
-   PARAMETER top_k 40
-   PARAMETER repeat_penalty 1.1
-   PARAMETER num_ctx 32768
-   ```
-3. Build the model in Ollama:
-   ```bash
-   ollama create dante-7b -f Modelfile
-   ```
-4. Run the model:
-   ```bash
-   ollama run dante-7b
-   ```
 ## 📜 License
-Refer to the original model's license in [outflanknl/Dante-7B](https://huggingface.co/outflanknl/Dante-7B).

+# 🦅 Dante-7B (GGUF) – Optimized for Ollama
+This repository provides a **quantized and GGUF-converted** version of the [Dante-7B](https://huggingface.co/outflanknl/Dante-7B) model, based on the **Qwen2 7B** architecture.
+It is optimized for use with [Ollama](https://ollama.ai) 💻 or any backend compatible with [`llama.cpp`](https://github.com/ggerganov/llama.cpp).
+---
+## 📦 Model Origin
+- **Base model**: [`outflanknl/Dante-7B`](https://huggingface.co/outflanknl/Dante-7B)
+- **Architecture**: Qwen2 7B
+- **Format conversion**: Performed with the official `llama.cpp` conversion script:
   ```bash
   python3 convert_hf_to_gguf.py /path/to/original/model --outfile Dante-7B.gguf
   ```
+---
 ## 🔧 Quantization
+The model has been **quantized** to reduce memory usage and improve inference speed while keeping high-quality outputs.
+---
+## 📂 Files in This Repository
+- `Dante-7B.gguf` → Ready-to-use model file (GGUF format)
+- Example **`Modelfile`** for Ollama
+---
+## 🚀 Quick Start with Ollama
+### 1️⃣ Download the repository
+```bash
+git lfs install
+git clone https://huggingface.co/ganchito/dante-7b.gguf
+cd dante-7b.gguf
+```
+### 2️⃣ Create a `Modelfile`
+Example configuration:
+```dockerfile
+FROM Dante-7B.gguf
+# Model configuration
+PARAMETER stop "<|im_end|>"
+PARAMETER stop "<|endoftext|>"
+PARAMETER stop "<|im_start|>"
+PARAMETER stop "<|endoftext|>"
+SYSTEM \"\"\"You are Dante, a 7B parameter language model based on Qwen2 architecture.
+You are a helpful, creative, and intelligent AI assistant.
+You can engage in conversations, answer questions, help with tasks, and provide thoughtful responses.
+Always be respectful, honest, and helpful while maintaining a conversational and engaging tone.\"\"\"
+TEMPLATE \"\"\"{{ if .System }}<|im_start|>system
+{{ .System }}<|im_end|>
+{{ end }}{{ if .Prompt }}<|im_start|>user
+{{ .Prompt }}<|im_end|>
+{{ end }}<|im_start|>assistant
+{{ .Response }}<|im_end|>\"\"\"
+PARAMETER temperature 0.7
+PARAMETER top_p 0.9
+PARAMETER top_k 40
+PARAMETER repeat_penalty 1.1
+PARAMETER num_ctx 32768
+```
+### 3️⃣ Build the model in Ollama
+```bash
+ollama create dante-7b -f Modelfile
+```
+### 4️⃣ Run the model
+```bash
+ollama run dante-7b
+```
+---
 ## 📜 License
+This GGUF version is subject to the **same license** as the original [Dante-7B](https://huggingface.co/outflanknl/Dante-7B) model.