INDEX
Explanations
proper nouns related to specific places or people
instances of the letter 'o' in various contexts
New Auto-Interp
Negative Logits
glim
-0.92
condem
-0.78
confir
-0.70
t
-0.69
cryst
-0.67
rified
-0.66
mble
-0.65
s
-0.65
Seym
-0.65
è£
-0.64
POSITIVE LOGITS
zzi
1.44
cean
1.18
zzo
1.15
ctor
1.05
cephal
1.04
oms
1.03
onga
1.00
ebus
0.97
vernment
0.94
zza
0.94
Activations Density 0.077%