Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cooleel 's Collections
RL
general
LLMs
Agent
vlms
DocAI

LLMs

updated Mar 13
Upvote
-

  • Self-Boosting Large Language Models with Synthetic Preference Data

    Paper • 2410.06961 • Published Oct 9, 2024 • 17

  • Qwen2.5 Technical Report

    Paper • 2412.15115 • Published Dec 19, 2024 • 368

  • SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

    Paper • 2412.13649 • Published Dec 18, 2024 • 20

  • NeoBERT: A Next-Generation BERT

    Paper • 2502.19587 • Published Feb 26 • 39

  • Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence

    Paper • 2502.14905 • Published Feb 18 • 9

  • How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

    Paper • 2502.14502 • Published Feb 20 • 91

  • From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

    Paper • 2502.14802 • Published Feb 20 • 13

  • LLM Pretraining with Continuous Concepts

    Paper • 2502.08524 • Published Feb 12 • 29

  • MMTEB: Massive Multilingual Text Embedding Benchmark

    Paper • 2502.13595 • Published Feb 19 • 36

  • Large-Scale Data Selection for Instruction Tuning

    Paper • 2503.01807 • Published Mar 3 • 12

  • Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective

    Paper • 2503.01933 • Published Mar 3 • 12
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs