INDEX
Explanations
locations mentioned in news articles
mentions of the city of Pyongyang
New Auto-Interp
Negative Logits
++++++++++++++++
-0.93
lv
-0.76
drivers
-0.75
###
-0.75
gio
-0.75
Uncommon
-0.74
title
-0.73
Torrent
-0.72
igor
-0.72
ivan
-0.72
POSITIVE LOGITS
ongyang
1.31
Pyongyang
1.22
DPRK
0.97
Seoul
0.97
Korea
0.94
Koreans
0.89
Jong
0.85
ì
0.82
ascus
0.82
orea
0.81
Activations Density 0.010%