Model Card for Model ID

The 13B model of "SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens"

Model Details

Model Description

  • Developed by: ainergy
  • Language(s) (NLP): Code
  • Finetuned from model: CodeLlama-13B

Model Sources

Evaluation

Results

image/png

image/png

Walltime improvement

image/png

Downloads last month
5
Safetensors
Model size
13B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for ainergy/CodeLlama-SDSAT_L7_13B

Quantizations
1 model