INDEX
Explanations
mentions of the location "Pyongyang"
references to Pyongyang or North Korea
New Auto-Interp
Negative Logits
++++++++++++++++
-0.77
###
-0.74
iencies
-0.72
ently
-0.71
Compan
-0.69
RAW
-0.69
teness
-0.68
Uncommon
-0.68
igor
-0.67
vol
-0.67
POSITIVE LOGITS
ongyang
1.27
Pyongyang
1.06
ascus
0.88
DPRK
0.82
ijing
0.81
Seoul
0.78
abad
0.74
Lumpur
0.71
orea
0.70
sung
0.70
Activations Density 0.014%