INDEX
Explanations
words ending in 'o'
instances of a specific letter or vowel sound
New Auto-Interp
Negative Logits
glim
-0.94
Seym
-0.73
Shades
-0.72
iard
-0.69
condem
-0.68
ARGET
-0.67
è£
-0.67
channelAvailability
-0.66
rals
-0.65
ãĥķãĤ©
-0.65
POSITIVE LOGITS
zzi
1.26
cean
1.23
zzo
1.18
ghan
1.09
pport
1.02
vernment
1.01
zz
0.93
ctor
0.92
oms
0.92
ceans
0.92
Activations Density 0.041%