INDEX
Explanations
negations or denials in the text
New Auto-Interp
Negative Logits
myſelf
-0.97
Cæsar
-0.90
Majefty
-0.89
itſelf
-0.86
Monfieur
-0.80
aquilo
-0.80
ynchronously
-0.77
himſelf
-0.77
avoient
-0.76
auxquelles
-0.75
POSITIVE LOGITS
no
1.22
No
1.10
NO
0.87
No
0.83
a
0.79
real
0.77
igno
0.77
ไม่มี
0.73
no
0.72
additional
0.72
Activations Density 0.198%