KnutJaegersberg/Deacon-34B-200k

In this case the tokenizer is the yi_tokenizer, loading it requires trust_remote_code=True

Introduction The Yi series models are large language models trained from scratch by developers at 01.AI. The first public release contains two bilingual(English/Chinese) base models with the parameter sizes of 6B(Yi-6B) and 34B(Yi-34B). Both of them are trained with 4K sequence length and can be extended to 32K during inference time.

License The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).

Prompt Example:

### System:

You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.

### Instruction: 

How do you fine tune a large language model? 

### Response:

KnutJaegersberg
/

Deacon-34B-200k

Dataset used to train KnutJaegersberg/Deacon-34B-200k