INDEX
Explanations
instances of the word "by" and its variations in the text
New Auto-Interp
Negative Logits
sto
-0.18
pin
-0.17
urs
-0.16
store
-0.15
ping
-0.15
sw
-0.15
stor
-0.15
pedia
-0.15
point
-0.14
stm
-0.14
POSITIVE LOGITS
gone
0.24
-election
0.23
rne
0.22
antine
0.22
gger
0.20
esian
0.19
products
0.19
virtue
0.19
-products
0.18
laws
0.18
Activations Density 0.112%