Model Card for Model ID

The 13B model of "SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens"

Model Details

Model Description

Developed by: ainergy
Language(s) (NLP): Code
Finetuned from model: CodeLlama-13B

Model Sources

Repository: https://github.com/ainergy-ml/SDSAT
Paper: https://arxiv.org/abs/2403.18647

Evaluation

Results

Walltime improvement

Downloads last month: 5

Safetensors

Model size

13B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for ainergy/CodeLlama-SDSAT_L7_13B

Quantizations

1 model