INDEX
Explanations
numerical data and statistics
New Auto-Interp
Negative Logits
,
-0.63
-0.57
.
-0.57
+
-0.50
…
-0.49
...
-0.49
d
-0.47
↵↵
-0.47
(
-0.46
<eos>
-0.45
POSITIVE LOGITS
Monfieur
1.22
Efq
1.20
myſelf
1.15
Majefty
1.11
itſelf
1.09
Cæsar
1.06
Jefus
1.01
Theſe
1.00
raiſ
1.00
fubject
0.97
Activations Density 1.372%