Back to Models
RedHatAI logo

RedHatAI/gemma-4-26B-A4B-it-NVFP4

RedHatAIgeneral

This is a preliminary version (and subject to change) of the NVFP4 quantized google/gemma-4-26B-A4B-it model. The model has both weights and activations quantized to NVFP4 with vllm-project/llm-compressor.

Run it with vllm serve RedHatAI/gemma-4-26B-A4B-it-NVFP4 --max-model-len 96000 on vllm nightly as this PR is needed https://github.com/vllm-project/vllm/pull/39045.

On a single B200:

lm_eval   --model local-chat-completions   --tasks gsm8k_platinum_cot_llama   --model_args "model=RedHatAI/gemma-4-26B-A4B-it-NVFP4,max_length=96000,base_url=http://0.0.0.0:8000/v1/chat/completions,num_concurrent=128,max_retries=3,tokenized_requests=False,tokenizer_backend=None,timeout=2400"   --num_fewshot 5   --apply_chat_template   --fewshot_as_multiturn   --output_path results_gsm8k_platinum.json   --seed 1234   --gen_kwargs "do_sample=True,temperature=1.0,top_p=0.95,top_k=64,max_gen_toks=64000,seed=1234"

Original:

|         Tasks          |Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|------------------------|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k_platinum_cot_llama|      3|flexible-extract|     5|exact_match|↑  |0.9702|±  |0.0049|
|                        |       |strict-match    |     5|exact_match|↑  |0.9702|±  |0.0049|

NVFP4:

|         Tasks          |Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|------------------------|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k_platinum_cot_llama|      3|flexible-extract|     5|exact_match|↑  |0.9562|±  |0.0059|
|                        |       |strict-match    |     5|exact_match|↑  |0.9562|±  |0.0059|
Visit Website

0 reviews

5
0
4
0
3
0
2
0
1
0
Likes25
Downloads
📝

No reviews yet

Be the first to review RedHatAI/gemma-4-26B-A4B-it-NVFP4!

Model Info

ProviderRedHatAI
Categorygeneral
Reviews0
Avg. Rating / 5.0

Community

Likes25
Downloads

Rating Guidelines

★★★★★Exceptional
★★★★Great
★★★Good
★★Fair
Poor