CharlesLi
/
OpenELM-1_1B-DPO-full-max-8-reward

Model card Files Files and versions Metrics Training metrics Community