INDEX
Explanations
variations of the word "some."
New Auto-Interp
Negative Logits
oppers
-0.17
eous
-0.16
ivot
-0.15
overy
-0.15
eos
-0.15
ymm
-0.15
yonel
-0.14
yms
-0.14
ÑģÑĤа
-0.14
sik
-0.14
POSITIVE LOGITS
ewhere
0.32
brero
0.30
ewhat
0.29
erville
0.27
erset
0.26
etime
0.26
mers
0.25
thing
0.23
ETIME
0.22
ber
0.21
Activations Density 0.008%