Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
estnafinema0 's Collections
PEFT variations
NER Extraction. Active Learning Approach.
SmolLM Variation: PPO & DPO Fine-Tuning for RLHF

PEFT variations

updated Apr 11
Upvote
-

  • estnafinema0/llm-course-hw3-dora

    Text Generation • Updated Apr 11 • 10

  • estnafinema0/llm-course-hw3-lora

    Text Generation • Updated Apr 11 • 3

  • estnafinema0/llm-course-hw3-tinyllama-qlora

    Updated Apr 11

  • estnafinema0/llm-course-hw3-tinyllamma-qlora

    Updated Apr 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs