Fix pipeline tag, add links, improve model card

#1
by nielsr HF staff - opened

This PR updates the model card with key information:

  • Sets the pipeline tag to reinforcement-learning.
  • Links to the paper and the associated code repository.
  • Adds the base model and evaluation metrics to the metadata.
  • Adds dataset information to the metadata.
xwm changed pull request status to merged

Sign up or log in to comment