INDEX
Explanations
conditional phrases and statements that express uncertainty or hypotheticals
New Auto-Interp
Negative Logits
ſelves
-1.00
pleaſure
-1.00
ſelf
-0.99
itſelf
-0.99
―――――
-0.97
Majefty
-0.97
myſelf
-0.95
Efq
-0.90
་་
-0.89
Anſ
-0.89
POSITIVE LOGITS
they
1.62
we
1.39
someone
1.38
somebody
1.32
you
1.28
someone
1.21
somebody
1.13
SOMEONE
1.04
people
1.03
nobody
1.00
Activations Density 0.505%