INDEX
Explanations
occurrences of the letter 'Z' in various contexts
New Auto-Interp
Negative Logits
idal
-0.17
asion
-0.17
PEAT
-0.16
allon
-0.15
ENTIAL
-0.15
ifen
-0.15
itus
-0.15
owns
-0.15
orage
-0.14
allet
-0.14
POSITIVE LOGITS
ipline
0.18
weis
0.17
ACH
0.17
iad
0.17
cock
0.17
aned
0.16
aire
0.16
dech
0.16
ACK
0.16
sup
0.15
Activations Density 0.016%