INDEX
Explanations
instances of negative events or controversial actions related to individuals in various roles or positions
instances of post-resignation events or consequences
New Auto-Interp
Negative Logits
soDeliveryDate
-0.75
iour
-0.73
toget
-0.70
kings
-0.69
ipeg
-0.68
marqu
-0.65
Purg
-0.65
enfranch
-0.64
rieve
-0.64
inct
-0.62
POSITIVE LOGITS
allegations
1.16
allegedly
1.09
mishand
1.07
violating
1.05
misconduct
1.05
breaching
1.04
violations
1.03
alleged
1.02
fals
0.98
accusations
0.98
Activations Density 0.205%