INDEX
Explanations
proper nouns and names related to individuals and organizations in various contexts
New Auto-Interp
Negative Logits
yb
-0.17
ossier
-0.17
aleigh
-0.16
å±ĭ
-0.16
andest
-0.16
kü
-0.16
-transparent
-0.16
ÃŃses
-0.15
aspers
-0.15
éric
-0.15
POSITIVE LOGITS
owski
0.41
inski
0.32
ski
0.32
ows
0.30
sky
0.28
iew
0.27
kowski
0.25
cz
0.25
czy
0.25
ka
0.23
Activations Density 0.090%