INDEX
Explanations
occurrences of the word "do" and its variations
New Auto-Interp
Negative Logits
houſe
-1.01
ſche
-0.98
itſelf
-0.86
stiefel
-0.86
ſever
-0.86
Efq
-0.85
Jefus
-0.85
<<<<<<<<<<<<<<
-0.85
themſelves
-0.84
pleaſure
-0.84
POSITIVE LOGITS
do
1.47
done
1.25
does
1.24
Do
1.22
doing
1.19
Do
1.17
Doing
1.15
did
1.13
Doing
1.11
DOING
1.10
Activations Density 0.157%