INDEX
Explanations
mentions of countries and their political relations or actions
New Auto-Interp
Negative Logits
ITH
-0.18
chw
-0.16
xmm
-0.14
inz
-0.14
IFY
-0.14
eer
-0.14
mith
-0.14
STS
-0.13
ussian
-0.13
à¸Ļวà¸Ļ
-0.13
POSITIVE LOGITS
Gilbert
0.17
preneur
0.16
buurt
0.16
hlen
0.15
FM
0.15
ager
0.14
venue
0.14
achel
0.14
past
0.14
Kan
0.14
Activations Density 0.076%