INDEX
Explanations
occurrences of the letter 'z' in various contexts
New Auto-Interp
Negative Logits
utherford
-0.17
askell
-0.15
ossal
-0.15
mts
-0.15
cheon
-0.14
kek
-0.14
ordova
-0.13
reserve
-0.13
Gupta
-0.13
akedown
-0.13
POSITIVE LOGITS
ivil
0.27
entral
0.26
eh
0.25
wing
0.24
itat
0.24
uge
0.23
ulet
0.23
umin
0.22
iele
0.22
ahlen
0.22
Activations Density 0.006%