metadata

license: mit
datasets:
  - DarwinAnim8or/grug
language:
  - en
pipeline_tag: text-generation
tags:
  - grug
  - caveman
  - fun

GPT-Grug-125m

A finetuned version of GPT-Neo-125M on the 'grug' dataset.

Training Procedure

This was trained on the 'grug' dataset, using the "HappyTransformers" library on Google Colab. This model was trained for 4 epochs with learning rate 1e-2.

Biases & Limitations

This likely contains the same biases and limitations as the original GPT-Neo-125M that it is based on, and additionally heavy biases from the grug datasets.

Intended Use

This model is meant for fun, please do not take anything this caveman says seriously.