learn-abc
/

html-model-tinyllama-chat-bnb-4bit-F32-GGUF

text-generation-inference

Model card Files Files and versions

learn-abc commited on 3 days ago

Commit

57eb066

·

verified ·

1 Parent(s): ee632aa

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -52,14 +52,16 @@ llama-server -m base_model.gguf --lora html-model-tinyllama-chat-bnb-4bit-f32.gg
 ## Use python script
 ```bash
 pip install llama-cpp-python
 ```
 ```python
 from llama_cpp import Llama
 # Replace with the actual path to your downloaded GGUF file
-model_path = "/path/to/your/downloaded/html-model-tinyllama-chat-bnb-4bit-F32-GGUF"
 llm = Llama(model_path=model_path)

 ## Use python script
+### Install llama.cpp
 ```bash
 pip install llama-cpp-python
 ```
+### Python script to run the model
 ```python
 from llama_cpp import Llama
 # Replace with the actual path to your downloaded GGUF file
+model_path = "/path/to/your/downloaded/html-model-tinyllama-chat-bnb-4bit-F32-GGUF.gguf"
 llm = Llama(model_path=model_path)