Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ingyu Seong's picture

Ingyu Seong

ingyu
  • https://github.com/ingyuseong

AI & ML interests

None yet

Organizations

None yet

Collections 4

Inference Optimization
  • The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

    Paper • 2408.01050 • Published Aug 2, 2024 • 9
  • Efficient Inference of Vision Instruction-Following Models with Elastic Cache

    Paper • 2407.18121 • Published Jul 25, 2024 • 17
  • LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

    Paper • 2407.14057 • Published Jul 19, 2024 • 46
  • Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

    Paper • 2407.10969 • Published Jul 15, 2024 • 23
Model Compression
  • Compact Language Models via Pruning and Knowledge Distillation

    Paper • 2407.14679 • Published Jul 19, 2024 • 40

spaces 1

Runtime error

KLUE MRC

🤗

Jul 4, 2023

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs