|
--- |
|
license: apache-2.0 |
|
--- |
|
|
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
|
**dragon-yi-qa-tool** is a Q4_K_M GGUF quantized version of the DRAGON Yi model series, providing a fast, small inference implementation. |
|
|
|
|
|
Load in your favorite GGUF inference engine, or try with llmware as follows: |
|
|
|
from llmware.models import ModelCatalog |
|
|
|
qa_tool = ModelCatalog().load_model("llmware/dragon-yi-qa-tool") |
|
response = qa_tool.inference(question, text_sample) |
|
|
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
- **Developed by:** llmware |
|
- **Model type:** GGUF |
|
- **Language(s) (NLP):** English |
|
- **License:** Yi Community License |
|
- **Quantized from model:** llmware/dragon-yi-6b (finetuned yi-6b-base) |
|
|
|
## Uses |
|
|
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
|
|
All of the DRAGON models use the following prompt template: |
|
|
|
"<human> " + text + "\n" + question + "\n<bot>: " |
|
|
|
|
|
## Model Card Contact |
|
|
|
Darren Oberst & llmware team |
|
|
|
|
|
|