INDEX
Explanations
mentions or references to the United Nations (UN)
references to the United Nations
New Auto-Interp
Negative Logits
Ages
-0.82
veland
-0.77
Camb
-0.74
thora
-0.71
é¾į
-0.70
Guardiola
-0.68
addock
-0.66
EStream
-0.66
afort
-0.65
Samurai
-0.65
POSITIVE LOGITS
SC
1.13
KNOWN
1.04
ilateral
0.96
ICE
0.91
envoy
0.90
GA
0.89
delegation
0.88
treaty
0.87
IF
0.86
ITED
0.86
Activations Density 0.012%