INDEX
Explanations
titles of people in charge
New Auto-Interp
Negative Logits
Efq
-2.27
ſelf
-2.14
Monfieur
-2.14
myſelf
-2.14
ſelves
-2.09
Theſe
-2.05
Majefty
-2.05
Jefus
-1.98
itſelf
-1.93
auffi
-1.88
POSITIVE LOGITS
↵↵
1.52
1.42
↵
1.21
<eos>
1.20
1.18
1
1.16
2
1.14
(
1.12
3
1.07
I
1.06
Activations Density 1.267%