Update README.md

85d95d6 verified 8 days ago

352 Bytes

metadata

license: apache-2.0
datasets:
  - PKU-Alignment/align-anything
base_model:
  - Qwen/Qwen2.5-0.5B-Instruct

DPO training is performed using the Align-Anything framework, with the PKU-Alignment/align-anything text-to-text dataset.