INDEX
Explanations
terms related to censorship, suppression, and control of information
terms related to oppression and suppression of individuals or groups
New Auto-Interp
Negative Logits
coni
-0.79
rapnel
-0.78
oof
-0.77
poral
-0.73
teasp
-0.72
ukong
-0.71
gars
-0.71
ructose
-0.70
cin
-0.70
coon
-0.70
POSITIVE LOGITS
legitimate
1.05
whistleblowers
1.04
innocent
1.02
inconvenient
1.01
lawful
0.96
freedoms
0.95
democratic
0.94
dissent
0.92
genuine
0.92
democratically
0.91
Activations Density 0.380%