Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdy1130 's Collections
LLM

LLM

updated Jan 27
Upvote
1

  • Towards a Unified View of Preference Learning for Large Language Models: A Survey

    Paper • 2409.02795 • Published Sep 4, 2024 • 74

  • MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

    Paper • 2409.05840 • Published Sep 9, 2024 • 49

  • OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

    Paper • 2409.05152 • Published Sep 8, 2024 • 33

  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 140

  • LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

    Paper • 2410.02707 • Published Oct 3, 2024 • 49

  • LongGenBench: Long-context Generation Benchmark

    Paper • 2410.04199 • Published Oct 5, 2024 • 22

  • Aria: An Open Multimodal Native Mixture-of-Experts Model

    Paper • 2410.05993 • Published Oct 8, 2024 • 112

  • Humanity's Last Exam

    Paper • 2501.14249 • Published Jan 24 • 76

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22 • 406
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs