INDEX
Explanations
European countries and political entities
references to countries and their interactions or relationships in a geopolitical context
New Auto-Interp
Negative Logits
uces
-0.78
gets
-0.70
comes
-0.69
ãĤ¦ãĤ¹
-0.66
Translation
-0.64
urations
-0.64
spo
-0.64
Means
-0.64
pron
-0.63
milo
-0.63
POSITIVE LOGITS
accuse
1.22
have
1.20
are
1.15
oppose
1.13
agree
1.12
contend
1.09
jointly
1.08
agreed
1.05
owe
1.05
pledged
1.04
Activations Density 0.213%