Ballpark-Trivia-XL / README.md
pszemraj's picture
Create README.md
6eac879
|
raw
history blame
1.14 kB

Ballpark Trivia: Size XL

Are you frequently asked google-able Trivia questions and annoyed by it? Well, this is the model for you! Ballpark Trivia Bot answers any trivia question with something that sounds plausible but is probably not 100% correct. It should be more credible or at least able to handle more kinds of questions

Training

This text gen model is a GPT-2 ~1.5 B Parameter Size XL Model, first trained on Wizard of Wikipedia for 40k steps (33/36 layers frozen for the fine-tuning), and then subsequently trained for 40k steps on a parsed variant of Natural Questions(then 34/36 layers frozen for the second fine-tuning) to accidentally create this model.

Note that because the model was originally trained for use in a chatbot application, it uses a named conversation dialogue structure, i.e. the questions are asked by person alpha, and responded to by person beta. Even if you don't specify person alpha in the prompt, it hopefully responds to any question.