This repository contains the trained checkpoints corresponding to our paper RepoFusion: Training Code Models to Understand Your Repository. The released checkpoints are:

RepoFusion_PPC: RepoFusion model trained with prompt proposal repo contexts. This is our best-performing model.
RepoFusion_BM25: RepoFusion model trained with BM25 repo contexts.
RepoFusion_RandomNN: RepoFusion model trained with RandomNN repo contexts.
finetuned_codet5base_512: Our finetuned CodeT5-base model. This was used as initialization for our RepoFusion models.
finetuned_codet5large_512: Our finetuned CodeT5-large model. This was used as a baseline.

For details of how these models were trained and evaluated, please check our paper RepoFusion: Training Code Models to Understand Your Repository.

Citation

@article{shrivastava2023repofusion,
  title={RepoFusion: Training Code Models to Understand Your Repository},
  author={Shrivastava, Disha and Kocetkov, Denis and de Vries, Harm and Bahdanau, Dzmitry and Scholak, Torsten},
  journal={arXiv preprint arXiv:2306.10998},
  year={2023}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for RepoFusion/trained_checkpoints

RepoFusion: Training Code Models to Understand Your Repository

Paper • 2306.10998 • Published Jun 19, 2023 • 13