INDEX
Explanations
proper nouns, particularly names of individuals and organizations
New Auto-Interp
Negative Logits
ÄĻ
-0.17
enz
-0.16
çī
-0.15
ombok
-0.15
imin
-0.14
éħ¸
-0.14
burgh
-0.14
expo
-0.14
lesi
-0.14
اÙĦÙħص
-0.14
POSITIVE LOGITS
SEQU
0.15
lek
0.14
eer
0.14
istrovstvÃŃ
0.14
fund
0.14
MacDonald
0.14
Ĥ¤
0.14
inions
0.14
uib
0.14
czy
0.13
Activations Density 0.043%