INDEX
Explanations
phrases related to blame or accusations
phrases indicating authorship or responsibility in a context of political or social actions
New Auto-Interp
Negative Logits
borgh
-0.81
orem
-0.76
aukee
-0.76
imity
-0.74
daq
-0.71
vari
-0.70
ãĤ¡
-0.69
yssey
-0.69
answer
-0.68
hene
-0.68
POSITIVE LOGITS
politicians
1.33
bureaucrats
1.19
extremists
1.13
policymakers
1.11
elites
1.07
leftists
1.05
governments
1.05
successive
1.04
perpetrators
1.03
individuals
1.02
Activations Density 0.214%