INDEX
Explanations
mentions of the United Nations (UN)
references to the United Nations (UN)
New Auto-Interp
Negative Logits
\(
-0.69
subp
-0.63
Guardiola
-0.62
ynski
-0.60
Klopp
-0.59
prone
-0.57
ographed
-0.57
swinging
-0.57
quizz
-0.57
zzle
-0.57
POSITIVE LOGITS
KNOWN
1.32
ICE
1.30
LV
1.25
SC
1.22
ESCO
1.19
IVERS
1.17
ITED
1.14
IX
1.13
GA
1.13
DP
1.13
Activations Density 0.020%