phi-3-gguf

phi-3-gguf is a GGUF 4_K_M (int4) quantized version of Microsoft Phi-3-mini-instruct, providing a very fast, very small inference implementation, optimized for AI PCs.

Model Description

  • Developed by: microsoft
  • Model type: phi3
  • Parameters: 3.8 billion
  • Model Parent: microsoft/Phi-3-mini-instruct
  • Language(s) (NLP): English
  • License: Apache 2.0
  • Uses: Chat, general-purpose LLM
  • Quantization: 4_K_M (int4)

Model Card Contact

llmware on hf

llmware website

Downloads last month
5
GGUF
Model size
3.82B params
Architecture
phi3
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support