Back to Models
RedHatAI logo

RedHatAI/gemma-4-31B-it-speculator.eagle3

RedHatAIgeneral

gemma4-31B-it-speculator.eagle3

This is a preliminary model release, we will continue to train the model and improve the acceptance rates in the next few days.

Model Overview

  • Verifier: google/gemma-4-31b-it
  • Speculative Decoding Algorithm: EAGLE-3
  • Model Architecture: Eagle3Speculator
  • Release Date: 04/09/2026
  • Version: 1.0
  • Model Developers: RedHat

This is a speculator model designed for use with google/gemma-4-31b-it, based on the EAGLE-3 speculative decoding algorithm. It was trained using the Speculators library on a combination of the Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered dataset and the train_sft split of the HuggingFaceH4/ultrachat_200k dataset. Training data used Magpie + UltraChat with responses from the gemma-4-31B-it model (no reasoning). This model should be used with the google/gemma-4-31b-it chat template, specifically through the /chat/completions endpoint.

vLLM version

UPDATE: Now supported on vllm-main!

Use with vLLM

vllm serve google/gemma-4-31b-it \
  --tensor-parallel-size 2 \
  --speculative-config '{
    "model": "RedHatAI/gemma-4-31B-it-speculator.eagle3",
    "num_speculative_tokens": 3,
    "method": "eagle3"
  }' \
  --max-num-seqs 64 \

Evaluations

Model / run:
vLLM: UPDATE: Now supported on vllm-main!

Training data: Magpie + UltraChat; responses from the gemma 4 31B it model (no reasoning).

Use cases

Use CaseDatasetNumber of Samples
CodingHumanEval164
Math Reasoningmath_reasoning80
Question Answeringqa80
MT_bench (Question)question80
RAGrag80
Summarizationsummarization80
Translationtranslation80

Acceptance lengths (draft length, temperature=default)

Datasetk=1k=2k=3k=4k=5
HumanEval1.862.553.103.503.80
math_reasoning1.872.593.153.593.93
qa1.642.012.222.342.38
question1.732.212.532.712.83
rag1.722.212.502.652.80
summarization1.601.922.072.152.20
translation1.692.132.412.572.68
Visit Website

0 reviews

5
0
4
0
3
0
2
0
1
0
Likes45
Downloads
📝

No reviews yet

Be the first to review RedHatAI/gemma-4-31B-it-speculator.eagle3!

Model Info

ProviderRedHatAI
Categorygeneral
Reviews0
Avg. Rating / 5.0

Community

Likes45
Downloads

Rating Guidelines

★★★★★Exceptional
★★★★Great
★★★Good
★★Fair
Poor