INDEX
Explanations
phrases related to political negotiations and strategies
New Auto-Interp
Head Attr Weights
0:0.04
1:0.08
2:0.10
3:0.03
4:0.05
5:0.09
6:0.09
7:0.08
8:0.08
9:0.11
10:0.13
11:0.07
Negative Logits
initely
-1.00
osite
-0.99
ItemThumbnailImage
-0.99
storage
-0.97
inar
-0.96
uilt
-0.96
Mystery
-0.95
icro
-0.94
paired
-0.92
idated
-0.91
POSITIVE LOGITS
advers
1.26
scapego
1.05
empires
1.04
reforms
1.03
behavi
1.02
centr
1.02
undermining
1.01
vend
0.99
Democr
0.99
doms
0.98
Activations Density 0.772%