Back to Models
GestaltLabs logo

GestaltLabs/Ornstein-Hermes-3.6-27b-SABER-GGUF

GestaltLabsgeneral

Ornstein-Hermes-3.6-27B SABER GGUF

GGUF quantizations of GestaltLabs/Ornstein-Hermes-3.6-27b-SABER, a SABER-edited version of GestaltLabs/Ornstein-Hermes-3.6-27b.

SABER is a controlled refusal-shaping workflow. The release target is to reduce broad over-refusal while preserving ordinary model behavior and visible boundaries for severe criminal, coercive, or interpersonal-harm requests. The selected checkpoint was chosen as a Pareto point over refusal rate and behavioral drift.

Source Checkpoint

fieldvalue
Source repoGestaltLabs/Ornstein-Hermes-3.6-27b-SABER
Base modelGestaltLabs/Ornstein-Hermes-3.6-27b
SABER runornstein_hermes36_27b_svd_a850_g25_retry_biggpu
Expanded refusal eval1 / 349 refusals
Refusal rate0.29%
KLD mean11.2216
Base-vs-base KLD mean11.2206
KLD delta over base-vs-base+0.0010
KLD prompts149
Tokens scored for KLD3,347

The one retained refusal in the expanded evaluation was an illegal-drug-sales request. This is an observed result on the current evaluation set, not a universal guarantee about future behavior.

Quantization Files

filequantsizenotes
Ornstein-Hermes-3.6-27b-SABER-IQ4_XS.ggufIQ4_XS15GCompact imatrix-assisted 4-bit option.
Ornstein-Hermes-3.6-27b-SABER-IQ2_M.ggufIQ2_M9GSmallest emergency 2-bit option; expect the most quality loss.
Ornstein-Hermes-3.6-27b-SABER-Q3_K_M.ggufQ3_K_M13GSmallest file in this suite; expect more quality loss.
Ornstein-Hermes-3.6-27b-SABER-Q4_K_M.ggufQ4_K_M16GGeneral-purpose recommended starting point.
Ornstein-Hermes-3.6-27b-SABER-Q5_K_M.ggufQ5_K_M18GBalanced high-quality option.
Ornstein-Hermes-3.6-27b-SABER-Q6_K.ggufQ6_K21GStrong quality/size option for high-memory local inference.
Ornstein-Hermes-3.6-27b-SABER-Q8_0.ggufQ8_027GHighest quality quant in this suite; largest runtime file.

The included imatrix file was generated from DJLougen/Acta-Synthetic. It is included for reproducibility and for users who want to regenerate adjacent quantizations.

Recommended File

Start with for normal desktop use. Use or if you have enough VRAM/RAM and want a higher-quality local run. Use when file size matters more. is mainly for high-memory systems or as a near-lossless GGUF reference.

llama.cpp Compatibility

These files were produced with llama.cpp commit from a BF16 GGUF conversion of the SABER checkpoint. The model uses the GGUF architecture path in current llama.cpp.

Example:

For chat-style use, prefer a frontend or wrapper that applies the tokenizer chat template from the GGUF metadata.

Conversion and Quantization Notes

The Q8_0 GGUF was converted from the full SABER Hugging Face checkpoint. The lower-bit recovery quants were generated from the published Q8_0 GGUF with --allow-requantize and the included Acta-Synthetic imatrix so the missing files could be restored quickly. Importance-matrix calibration used Acta-Synthetic conversational text.

Method Summary

SABER edits refusal behavior through activation/weight-space refusal directions. For this checkpoint, the run used SVD extraction, multi-layer candidate selection, iterative ablation, and KLD-based drift measurement.

Run configuration:

Selected layers:

Total directions ablated: .

Attribution and Related Work

This release builds on the refusal-direction and abliteration research lineage. Relevant prior work and inspirations include:

SABER's contribution in this release is the controlled-refusal-shaping workflow: multi-candidate refusal extraction, separability/entanglement-aware ranking, differential ablation strength, and explicit Pareto selection over refusal behavior and KLD drift.

Limitations

  • Results are specific to the current evaluation set, prompts, and generation settings.
  • The KLD value should be interpreted relative to the base-vs-base control, not as an absolute standalone score.
  • Quantization changes numerical behavior; validate the specific GGUF file you deploy.
  • The model inherits constraints, limitations, and licensing considerations from the base model.
  • This is a model-editing research artifact with dual-use implications.
Visit Website

0 reviews

5
0
4
0
3
0
2
0
1
0
Likes15
Downloads
📝

No reviews yet

Be the first to review GestaltLabs/Ornstein-Hermes-3.6-27b-SABER-GGUF!

Model Info

ProviderGestaltLabs
Categorygeneral
Reviews0
Avg. Rating / 5.0

Community

Likes15
Downloads

Rating Guidelines

★★★★★Exceptional
★★★★Great
★★★Good
★★Fair
Poor