INDEX
Explanations
the word "even" in various contexts
New Auto-Interp
Negative Logits
myſelf
-1.14
Majefty
-1.13
Jefus
-1.06
Efq
-1.04
ſelf
-1.02
PerformLayout
-1.02
pleaſure
-1.00
ſeveral
-1.00
faſt
-0.97
raiſ
-0.94
POSITIVE LOGITS
no
0.95
never
0.84
cannot
0.78
No
0.76
nobody
0.74
NO
0.71
nothing
0.70
none
0.69
no
0.68
never
0.68
Activations Density 0.282%