INDEX
Explanations
references to prominent political figures and their actions or statements
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.14
3:0.07
4:0.04
5:0.04
6:0.05
7:0.03
8:0.04
9:0.05
10:0.20
11:0.25
Negative Logits
repeal
-1.70
colonization
-1.55
referendum
-1.51
govern
-1.49
annexation
-1.47
PayPal
-1.43
reneg
-1.42
mandate
-1.42
moratorium
-1.37
amen
-1.36
POSITIVE LOGITS
��
1.68
utenberg
1.66
ulton
1.64
ographed
1.64
olphin
1.59
ograph
1.49
abulary
1.49
icultural
1.48
Lauder
1.47
HAEL
1.45
Activations Density 0.035%