Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Julius-L 's Collections
inference acceleration
multimodal dataset
Generation
Long Context
Finetuning
Memory Efficient Training
Pretraining
Model Architecture
Model Merging
Sparsification
Quantization
LLM Technical Reports
Unseen Papers

LLM Technical Reports

updated Nov 4, 2024
Upvote
-

  • The Llama 3 Herd of Models

    Paper • 2407.21783 • Published Jul 31, 2024 • 117

  • Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

    Paper • 2409.12191 • Published Sep 18, 2024 • 78

  • Baichuan Alignment Technical Report

    Paper • 2410.14940 • Published Oct 19, 2024 • 52

  • A Survey of Small Language Models

    Paper • 2410.20011 • Published Oct 25, 2024 • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs