Text Generation
Transformers
GGUF
llama
Inference Endpoints
Edit model card

Model by Photolens/llama-2-7b-langchain-chat converted in GGUF format.

Model Overview

Model license: Llama-2
This model is trained based on NousResearch/Llama-2-7b-chat-hf model that is QLoRA finetuned on Photolens/oasst1-langchain-llama-2-formatted dataset.

Prompt Template: Llama-2

<s>[INST] Prompter Message [/INST] Assistant Message </s>

Intended Use

Dataset that is used to finetune base model is optimized for langchain applications.

Downloads last month
110
GGUF
Model size
6.74B params
Architecture
llama

4-bit

5-bit

6-bit

8-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using YanaS/llama-2-7b-langchain-chat-GGUF 3