Abliterate

verb

verb ·Rare ·Advanced level

Definitions

Verb
  1. 1
    To uncensor a large language model by modifying specific model internals to remove refusal behaviours or unwanted traits, while aiming to preserve the model's other capabilities. neologism

    "Now that we have our datasets, we can load the model we want to abliterate. […] I evaluated the abliterated and source models from the previous section on the Open LLM Leaderboard and on Nous' benchmark suite."

Example

More examples

"Now that we have our datasets, we can load the model we want to abliterate. […] I evaluated the abliterated and source models from the previous section on the Open LLM Leaderboard and on Nous' benchmark suite."

Etymology

Blend of ablate + obliterate. Coined by Redditor /u/FailSpai in early 2024, as the idea is to ablate refusal features to the point of obliteration.

Data sourced from Wiktionary, WordNet, CMU, and other open linguistic databases. Updated March 2026.