INDEX
Explanations
words related to suppression or control
terms associated with control and suppression
New Auto-Interp
Negative Logits
ittal
-0.91
çķ
-0.72
人
-0.69
uben
-0.68
format
-0.66
olate
-0.65
ortality
-0.64
Equipment
-0.63
oeuv
-0.63
olds
-0.62
POSITIVE LOGITS
dissent
1.32
dissenting
1.08
criticism
0.99
rumours
0.98
rumors
0.97
criticisms
0.94
rivals
0.92
critics
0.92
rebell
0.91
doubts
0.90
Activations Density 0.230%