INDEX
Explanations
mentions of the country South Korea
references to South Korea
New Auto-Interp
Negative Logits
lli
-0.76
llo
-0.71
apple
-0.67
Mayweather
-0.65
ments
-0.62
adeon
-0.62
Gathering
-0.62
alg
-0.61
igue
-0.60
é¾įåĸļ士
-0.60
POSITIVE LOGITS
orea
1.12
Korea
0.89
ese
0.82
Koreans
0.81
peninsula
0.80
DPRK
0.77
DPR
0.77
Peninsula
0.76
sung
0.74
atan
0.73
Activations Density 0.016%