Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nazzaroth2 's Collections
Reward Modeling
models to test out
data synthesis
RL_Papers in general
OCR
imageGen
VLM RL Reasoning
LLM-External_information
llm_compression
LLM_Reasoning-ErrorCorrection
Loras
3D (nerfs, gaussians, generation etc.)
t2i consistency works
videogames_roleplay
small_or_multimodal_llm
manga_translation
long_context
model training

llm_compression

updated Mar 27, 2024
Upvote
-

  • BitNet: Scaling 1-bit Transformers for Large Language Models

    Paper • 2310.11453 • Published Oct 17, 2023 • 103

  • Learning From Mistakes Makes LLM Better Reasoner

    Paper • 2310.20689 • Published Oct 31, 2023 • 29

  • The Unreasonable Ineffectiveness of the Deeper Layers

    Paper • 2403.17887 • Published Mar 26, 2024 • 81
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs