Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kd303 's Collections
Books-data-training
Math
STEM-Datasets
Data Quality Models
Reasoning-lastest
code
Models
RAG
Fine-tuning
Reasoning
Synthetic Data papers
Agents

Fine-tuning

updated Apr 8
Upvote
-

  • Extending Llama-3's Context Ten-Fold Overnight

    Paper • 2404.19553 • Published Apr 30, 2024 • 35

  • ReFT: Representation Finetuning for Language Models

    Paper • 2404.03592 • Published Apr 4, 2024 • 99

  • Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

    Paper • 2404.07647 • Published Apr 11, 2024 • 4

  • SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

    Paper • 2401.07950 • Published Jan 15, 2024 • 4

  • Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

    Paper • 2312.06585 • Published Dec 11, 2023 • 29

  • OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

    Paper • 2412.16849 • Published Dec 22, 2024 • 9

  • laion/OIG

    Viewer • Updated Mar 31, 2023 • 52.6M • 11.1k • 303
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs