Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Testerpce 's Collections
Theory and Representation learning
Adversarial
Graph
Multimodal
Search
Interpretable
Diversity
Diffusion
Self correction
Information_retrieval
Speech
Attention
Synthetic data
Agent
MoE
RAG
Markov chain
Prompt papers
Planning
Sparsity
Multilingual
State space LLM
Partial layer training LLMs
Reasoning
Evaluation
Fine tuning
Math
Dataset and Data processing
Style transfer
Video understanding
Reinforcement learning
Long context
Knowledge

Fine tuning

updated Jan 28
Upvote
-

  • When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

    Paper • 2402.17193 • Published Feb 27, 2024 • 26

  • What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

    Paper • 2410.23743 • Published Oct 31, 2024 • 64

  • Direct Preference Optimization Using Sparse Feature-Level Constraints

    Paper • 2411.07618 • Published Nov 12, 2024 • 16

  • Transformer^2: Self-adaptive LLMs

    Paper • 2501.06252 • Published Jan 9 • 55

  • Control LLM: Controlled Evolution for Intelligence Retention in LLM

    Paper • 2501.10979 • Published Jan 19 • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs