INDEX
Explanations
phrases related to criticism and blame directed at authorities
assertive statements regarding authority and accountability
New Auto-Interp
Negative Logits
requent
-0.79
VERTISEMENT
-0.70
aughtered
-0.69
Moines
-0.68
acters
-0.68
avascript
-0.67
icipated
-0.67
Scroll
-0.67
izable
-0.67
estyles
-0.66
POSITIVE LOGITS
complicit
1.45
ignoring
1.39
conspiring
1.38
hypoc
1.35
abusing
1.32
waging
1.30
undermining
1.27
refusing
1.26
denying
1.26
behaving
1.25
Activations Density 0.217%