INDEX
Explanations
phrases related to political discussions and societal issues
statements related to public accountability and political scrutiny
New Auto-Interp
Negative Logits
foreseen
-0.66
estones
-0.64
contact
-0.63
wearer
-0.61
CV
-0.60
Finder
-0.59
PW
-0.59
ãĥ´ãĤ¡
-0.59
ounter
-0.58
ãĥŁ
-0.57
POSITIVE LOGITS
hypocrisy
1.25
arrogance
1.15
irresponsible
1.10
neglect
1.08
incompetence
1.06
hypocritical
1.06
ignorance
1.03
stupidity
1.02
misguided
1.01
ignorant
1.01
Activations Density 2.143%