INDEX
Explanations
references to political sanctions
New Auto-Interp
Head Attr Weights
0:0.27
1:0.01
2:0.03
3:0.22
4:0.02
5:0.05
6:0.05
7:0.03
8:0.04
9:0.10
10:0.10
11:0.03
Negative Logits
glas
-2.51
Lumpur
-2.40
Ara
-2.20
osa
-2.18
icago
-2.13
INAL
-2.13
Saf
-2.07
ouston
-2.05
Saras
-2.04
Palo
-2.02
POSITIVE LOGITS
tit
3.59
tug
2.75
chet
2.35
raints
2.31
Tit
2.23
tv
2.22
TV
2.20
chars
2.17
tabl
2.16
mantle
2.16
Activations Density 0.000%