Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MarioBarbeque 's Collections
Code Generation
Finetuning
Mathematics

Finetuning

updated Jan 27

Models to fine-tune (and datasets to ft with) in future projects

Upvote
1

  • nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

    Text Generation • 71B • Updated Apr 13 • 48.8k • • 2.05k

  • FacebookAI/roberta-base

    Fill-Mask • 0.1B • Updated Feb 19, 2024 • 6.38M • • 503

  • openai-community/gpt2

    Text Generation • 0.1B • Updated Feb 19, 2024 • 15.2M • 2.81k

  • databricks/dbrx-instruct

    Text Generation • 132B • Updated Apr 19, 2024 • 8.85k • 1.12k

  • google/gemma-2-9b

    Text Generation • 9B • Updated Aug 7, 2024 • 89k • 659

  • google/gemma-2-2b

    Text Generation • 3B • Updated Aug 7, 2024 • 186k • 562

  • google/gemma-2-2b-it

    Text Generation • 3B • Updated Aug 27, 2024 • 303k • • 1.12k

  • google/gemma-1.1-2b-it

    Text Generation • 3B • Updated Jun 27, 2024 • 99.8k • 162

  • nvidia/HelpSteer2

    Viewer • Updated Dec 18, 2024 • 21.4k • 2k • 419

  • HuggingFaceH4/no_robots

    Viewer • Updated Apr 18, 2024 • 10k • 1.71k • 483

  • cais/mmlu

    Viewer • Updated Mar 8, 2024 • 231k • 164k • 496

  • EleutherAI/gpt-j-6b

    Text Generation • Updated Jun 21, 2023 • 245k • 1.5k

  • google/flan-t5-large

    Text2Text Generation • 0.8B • Updated Jul 17, 2023 • 490k • 783

  • deepseek-ai/DeepSeek-R1

    Text Generation • 685B • Updated Mar 27 • 602k • • 12.4k
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs