LoRA Backward Model (1000 samples)

This model is a LoRA-finetuned version of NousResearch/Llama-2-7b-hf, trained to predict the instruction (x) given the assistant response (y). This implements the backward model training from the paper:

Self-Alignment with Instruction Backtranslation

Dataset

timdettmers/openassistant-guanaco and extract pairs of the form:

### Output (y)
<assistant's answer>

### Instruction (x)
<human's original question>

sijiasijia
/

lora-backward-1000

Model Card for Model ID

Model Description

LoRA Backward Model (1000 samples)

Dataset

Model tree for sijiasijia/lora-backward-1000