INDEX
Explanations
mathematical symbols and formatting elements
New Auto-Interp
Negative Logits
Efq
-1.59
Monfieur
-1.54
myſelf
-1.53
Theſe
-1.48
itſelf
-1.38
iſt
-1.36
Jefus
-1.35
―――――
-1.33
ſeveral
-1.33
Houſe
-1.33
POSITIVE LOGITS
\
1.08
\
0.74
$\
0.74
{\0.73
(
0.73
<eos>
0.71
</tr>
0.70
...
0.69
<tr>
0.68
.\
0.68
Activations Density 0.100%