INDEX
Explanations
proper nouns, particularly names of individuals
Follows initialisms or names
Korean given names
New Auto-Interp
Negative Logits
Korea
-0.96
Korean
-0.96
Koreans
-0.91
Korean
-0.87
Chinese
-0.85
China
-0.80
korea
-0.80
Chinese
-0.79
China
-0.79
CHINESE
-0.78
POSITIVE LOGITS
Park
0.61
Woo
0.51
Young
0.49
Boo
0.49
woo
0.47
Woo
0.47
young
0.47
Park
0.46
UrlResolution
0.46
onOptions
0.45
Activations Density 0.111%