llmware
/

dragon-yi-answer-tool

Inference Endpoints

Model card Files Files and versions Community

dragon-yi-answer-tool / README.md

doberst's picture

Update README.md

d11e664 verified about 1 year ago

|

1.11 kB

	---
	license: apache-2.0
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->

	dragon-yi-qa-tool is a Q4_K_M GGUF quantized version of the DRAGON Yi model series, providing a fast, small inference implementation.


	Load in your favorite GGUF inference engine, or try with llmware as follows:

	from llmware.models import ModelCatalog

	qa_tool = ModelCatalog().load_model("llmware/dragon-yi-qa-tool")
	response = qa_tool.inference(question, text_sample)


	### Model Description

	<!-- Provide a longer summary of what this model is. -->

	- Developed by: llmware
	- Model type: GGUF
	- Language(s) (NLP): English
	- License: Yi Community License
	- Quantized from model: llmware/dragon-yi-6b (finetuned yi-6b-base)

	## Uses

	<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

	All of the DRAGON models use the following prompt template:

	"<human> " + text + "\n" + question + "\n<bot>: "


	## Model Card Contact

	Darren Oberst & llmware team