INDEX
Explanations
phrases related to political events and relationships between countries
New Auto-Interp
Head Attr Weights
0:0.06
1:0.03
2:0.07
3:0.04
4:0.07
5:0.04
6:0.23
7:0.06
8:0.08
9:0.20
10:0.02
11:0.05
Negative Logits
Ford
-4.05
Ford
-3.73
charg
-3.69
lantern
-3.59
Hay
-3.43
Hud
-3.38
hous
-3.24
McKenzie
-3.21
comet
-3.19
emit
-3.13
POSITIVE LOGITS
Serbia
8.60
Serbian
8.01
Yugoslavia
6.14
Kosovo
6.12
Alban
6.04
Balkans
5.92
Macedonia
5.84
Albania
5.53
Croatia
5.51
Yugoslav
5.48
Activations Density 0.005%