INDEX
Explanations
mentions of the city of Seoul, South Korea
mentions of the city of Seoul
New Auto-Interp
Negative Logits
role
-0.81
alg
-0.71
iddler
-0.68
rd
-0.68
asso
-0.66
Mayweather
-0.66
Arthur
-0.66
vor
-0.64
RH
-0.64
apple
-0.64
POSITIVE LOGITS
Seoul
1.38
Korea
1.08
Lumpur
1.05
ãħĭãħĭ
0.94
ë
0.94
wark
0.91
Chung
0.91
ãħĭ
0.90
Pyongyang
0.87
Koreans
0.86
Activations Density 0.013%