INDEX
Explanations
references to individuals and their actions in a political context
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.06
3:0.12
4:0.08
5:0.03
6:0.05
7:0.15
8:0.17
9:0.05
10:0.12
11:0.08
Negative Logits
chemy
-1.59
metic
-1.43
vity
-1.37
bringer
-1.35
rup
-1.32
edit
-1.31
rises
-1.30
imore
-1.29
Shap
-1.25
GOODMAN
-1.25
POSITIVE LOGITS
ESA
1.31
ento
1.31
suspended
1.23
``
1.23
susp
1.22
retiring
1.20
wcs
1.19
jong
1.18
entitlement
1.17
TAMADRA
1.17
Activations Density 0.034%