README.md · NousResearch/DeepHermes-3-Llama-3-3B-Preview-GGUF at cf716d495c0aa3fe77a1eadebae009d167e02062

Upload folder using huggingface_hub

cf716d4 verified about 1 month ago

702 Bytes

	# GGUF Quantized Models

	This repository contains GGUF quantized versions of language models for efficient inference.

	## Files

	The following GGUF files are available in this repository:

	- `DeepHermes-3-Llama-3-3B-Preview-q4.gguf` (1.8G)
	- `DeepHermes-3-Llama-3-3B-Preview-q5.gguf` (2.2G)
	- `DeepHermes-3-Llama-3-3B-Preview-q6.gguf` (2.5G)
	- `DeepHermes-3-Llama-3-3B-Preview-q8.gguf` (3.2G)

	## Usage

	These models are compatible with [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar frameworks.

	Example usage with llama.cpp:
	```bash
	./main -m /path/to/model.gguf -p "Hello, I am a language model" -n 128
	```

	## Upload Information
	Files were uploaded on Tue Mar 11 04:28:49 PDT 2025