Novaciano
/

La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF

@@ -35,11 +35,11 @@ license: apache-2.0
 ---
 # Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF
-This model was converted to GGUF format from [`Novaciano/La_Mejor_Mezcla-3.2-1B`](https://huggingface.co/Novaciano/La_Mejor_Mezcla-3.2-1B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
-Refer to the [original model card](https://huggingface.co/Novaciano/La_Mejor_Mezcla-3.2-1B) for more details on the model.
-## Use with llama.cpp
-Install llama.cpp through brew (works on Mac and Linux)
 ```bash
 brew install llama.cpp
@@ -57,23 +57,23 @@ llama-cli --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejo
 llama-server --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -c 2048
 ```
-Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
-Step 1: Clone llama.cpp from GitHub.
 ```
 git clone https://github.com/ggerganov/llama.cpp
 ```
-Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
 ```
 cd llama.cpp && LLAMA_CURL=1 make
 ```
-Step 3: Run inference through the main binary.
 ```
 ./llama-cli --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -p "The meaning to life and the universe is"
 ```
-or
 ```
 ./llama-server --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -c 2048
 ```

 ---
 # Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF
+Este modelo se convirtió al formato GGUF desde [`Novaciano/La_Mejor_Mezcla-3.2-1B`](https://huggingface.co/Novaciano/La_Mejor_Mezcla-3.2-1B) utilizando llama.cpp a través del espacio [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) de ggml.ai.
+Consulta la [tarjeta del modelo original](https://huggingface.co/Novaciano/La_Mejor_Mezcla-3.2-1B) para obtener más detalles sobre el modelo.
+## Uso con llama.cpp
+Instalar llama.cpp a través de brew (funciona en Mac y Linux)
 ```bash
 brew install llama.cpp
 llama-server --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -c 2048
 ```
+**Nota:** También puedes usar este punto de control directamente a través de los [pasos de uso](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) que se enumeran en el repositorio Llama.cpp.
+**Paso 1:** Clona llama.cpp desde GitHub.
 ```
 git clone https://github.com/ggerganov/llama.cpp
 ```
+**Paso 2:** Vaya a la carpeta llama.cpp y compílela con el indicador `LLAMA_CURL=1` junto con otros indicadores específicos del hardware (por ejemplo: LLAMA_CUDA=1 para GPU Nvidia en Linux).
 ```
 cd llama.cpp && LLAMA_CURL=1 make
 ```
+**Paso 3:** Ejecutar la inferencia a través del binario principal.
 ```
 ./llama-cli --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -p "The meaning to life and the universe is"
 ```
+o
 ```
 ./llama-server --hf-repo Novaciano/La_Mejor_Mezcla-3.2-1B-Q8_0-GGUF --hf-file la_mejor_mezcla-3.2-1b-q8_0.gguf -c 2048
 ```