kaitchup/Qwen3.6-27B-autoround-nvfp4-linearattn-BF16

Name: kaitchup/Qwen3.6-27B-autoround-nvfp4-linearattn-BF16
Brand: kaitchup
Rating: 0.0 (9 reviews)

kaitchup • general

This is Qwen/Qwen3.6-27B quantized with AutoRound to NVFP4 with linear-attention layers kept in 16-bit. The model is compatible with vLLM (tested: v0.19). Tested with an RTX Pro 6000. Currently under evaluation. Similar quantization with Qwen3.5 worked very well.

Instructions

uv pip install vllm
uv pip install git+https://github.com/huggingface/transformers.git

vllm serve [this model ID]  --max-model-len 262144 --reasoning-parser qwen3

—

0 reviews

Likes9

Downloads—

📝

Be the first to review kaitchup/Qwen3.6-27B-autoround-nvfp4-linearattn-BF16!

Providerkaitchup

Categorygeneral

Reviews0

Avg. Rating— / 5.0

Likes9

Downloads—

★★★★★Exceptional

★★★★Great

★★★Good

★★Fair

★Poor