Model Card for Model ID

7374-course-llm

Model Description

LoRA Backward Model (1000 samples)

This model is a LoRA-finetuned version of NousResearch/Llama-2-7b-hf, trained to predict the instruction (x) given the assistant response (y). This implements the backward model training from the paper:

Self-Alignment with Instruction Backtranslation

Dataset

timdettmers/openassistant-guanaco and extract pairs of the form:

### Output (y)
<assistant's answer>

### Instruction (x)
<human's original question>










Downloads last month
87
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for sijiasijia/lora-backward-1000

Adapter
(128)
this model