INDEX
Explanations
phrases related to Korean topics
references to Korea and its related entities
New Auto-Interp
Negative Logits
yard
-0.79
MENTS
-0.79
MENT
-0.79
Mayweather
-0.78
ments
-0.77
igue
-0.77
llo
-0.76
clair
-0.72
ashtra
-0.70
lli
-0.70
POSITIVE LOGITS
orea
1.06
peninsula
1.05
Korea
0.99
Peninsula
0.91
ë
0.90
ì
0.89
Koreans
0.85
Sung
0.84
Korean
0.82
ongyang
0.81
Activations Density 0.023%