flax-community
/

code-mt5-base

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Tokenizer

We trained our tokenizer using sentencepiece's unigram tokenizer. Then loaded the tokenizer as MT5TokenizerFast.

Model

We used MT5-base model.

Datasets

We used Code Search Net's dataset and some scrapped data from internet to train the model. We maintained a list of datasets where each dataset had codes of same language.

Plots

Train loss

Evaluation loss

Evaluation accuracy

Learning rate

Fine tuning (WIP)

We fine tuned the model with CodeXGLUE code-to-code-trans dataset, and scrapper data.

Downloads last month: 18

Safetensors

Model size

241M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.