INDEX
Explanations
occurrences of the number five, either as a numeral or written out as a word
New Auto-Interp
Negative Logits
ards
-0.18
cano
-0.17
ogle
-0.16
lest
-0.16
iness
-0.16
(es
-0.16
shots
-0.15
elta
-0.15
essel
-0.14
188
-0.14
POSITIVE LOGITS
-HT
0.30
-star
0.25
де
0.22
pais
0.22
-Star
0.21
borough
0.20
Borough
0.20
zig
0.20
ive
0.19
senses
0.19
Activations Density 0.122%