INDEX
Explanations
proper names of individuals
proper nouns, particularly names of individuals or organizations
New Auto-Interp
Negative Logits
ivity
-0.74
verted
-0.71
ä¹ĭ
-0.69
istics
-0.69
ential
-0.65
yer
-0.63
encia
-0.61
ordinate
-0.61
acters
-0.59
rend
-0.59
POSITIVE LOGITS
Seym
0.80
insk
0.78
nih
0.78
useum
0.75
ambo
0.75
emonic
0.74
unicip
0.73
hattan
0.72
asonic
0.72
ullah
0.71
Activations Density 0.161%