INDEX
Explanations
occurrences of the verb "do" and its variations
New Auto-Interp
Negative Logits
houſe
-1.05
itſelf
-1.00
Jefus
-0.98
themſelves
-0.93
ſche
-0.92
himſelf
-0.90
pleaſure
-0.88
ſever
-0.88
myſelf
-0.87
ſelf
-0.85
POSITIVE LOGITS
do
1.23
done
1.22
Do
1.03
Do
1.00
does
0.94
DONE
0.93
DOING
0.89
Doing
0.89
doing
0.89
doing
0.88
Activations Density 0.051%