Llama-3.1-8B-Instruct-speculator.eagle3

Model Overview

  • Verifier: meta-llama/Llama-3.1-8B-Instruct
  • Speculative Decoding Algorithm: EAGLE-3
  • Model Architecture: Eagle3Speculator
  • Release Date: 07/27/2025
  • Version: 1.0
  • Model Developers: RedHat

This is a speculator model designed for use with meta-llama/Llama-3.1-8B-Instruct, based on the EAGLE-3 speculative decoding algorithm. It was trained using the speculators library on a combination of the Aeala/ShareGPT_Vicuna_unfiltered and the HuggingFaceH4/ultrachat_200k datasets.

Downloads last month
1,480
Safetensors
Model size
950M params
Tensor type
I64
F32
BOOL
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support