INDEX
Explanations
names with a specific pattern, likely looking for a specific individual's name
proper nouns related to specific individuals or organizations
New Auto-Interp
Negative Logits
namese
-0.87
ertation
-0.77
ingly
-0.76
æĦ
-0.75
spirited
-0.73
thous
-0.71
ains
-0.70
ained
-0.70
abilia
-0.65
ĭ
-0.64
POSITIVE LOGITS
Zig
1.09
zag
1.04
phrine
0.95
Zah
0.88
vez
0.88
agame
0.87
iggurat
0.83
Zin
0.81
ebra
0.80
ZZ
0.80
Activations Density 0.017%