INDEX
Explanations
words related to censorship or control of information
instances of the word "suppress" and its variations in contexts related to censorship or control
New Auto-Interp
Negative Logits
giving
-0.75
replace
-0.74
psc
-0.71
ser
-0.71
Äį
-0.70
zag
-0.69
aldo
-0.69
gow
-0.68
hd
-0.68
deals
-0.66
POSITIVE LOGITS
ively
0.96
impulses
0.84
emotions
0.83
dissent
0.77
feelings
0.77
laughter
0.75
é¾
0.73
tears
0.73
=#
0.73
distractions
0.71
Activations Density 0.073%