INDEX
Explanations
proper nouns and names, particularly of individuals
New Auto-Interp
Negative Logits
.Atomic
-0.15
athi
-0.15
yet
-0.14
Bab
-0.14
Clarkson
-0.14
athy
-0.13
समर
-0.13
iffer
-0.13
Blake
-0.13
ugg
-0.13
POSITIVE LOGITS
arella
0.19
Bian
0.17
elan
0.17
械
0.15
avian
0.15
Vaults
0.15
afort
0.15
esian
0.15
antro
0.15
igan
0.15
Activations Density 0.021%