LLM - GGUF
Collection
Text Generations Models in GGUF format, hand picked by Nexa Team.
β’
4 items
β’
Updated
β’
2
Run them directly with nexa-sdk installed In nexa-sdk CLI:
NexaAI/gpt-oss-20b-GGUF
This is a GGUF version of the OpenAI GPT OSS 20B model, for lower latency, and local or specialized use cases (21B parameters with 3.6B active parameters).
Original model card: ggml-org/gpt-oss-20b-GGUF