INDEX
Explanations
phrases that convey a sense of obligation or necessity
New Auto-Interp
Negative Logits
i
-0.54
I
-0.54
(
-0.53
a
-0.51
mo
-0.50
E
-0.49
↵
-0.47
-0.46
great
-0.46
can
-0.45
POSITIVE LOGITS
ſelf
1.20
nakalista
1.19
Majefty
1.13
itſelf
1.13
ſelves
1.05
myſelf
1.05
Efq
1.04
leſs
1.03
་་
1.01
―――――
1.00
Activations Density 0.051%