INDEX
Explanations
names of individuals
proper nouns, particularly names and geographical locations
New Auto-Interp
Negative Logits
ues
-0.75
ulatory
-0.73
sight
-0.71
ãĤ¡
-0.70
encers
-0.69
ski
-0.69
furt
-0.68
cod
-0.68
ences
-0.67
ICE
-0.67
POSITIVE LOGITS
ysc
0.77
ascus
0.75
è£ħ
0.72
Kod
0.70
mercial
0.68
merce
0.67
chuk
0.67
ickr
0.65
gow
0.64
ħĭ
0.63
Activations Density 0.056%