INDEX
Explanations
vertical divider symbols and keywords like "array", and other function words near relevant symbols
code/programming
New Auto-Interp
Negative Logits
<bos>
-1.85
Efq
-1.46
myſelf
-1.41
Monfieur
-1.39
itſelf
-1.38
Theſe
-1.33
Jefus
-1.23
Majefty
-1.22
himſelf
-1.21
themſelves
-1.21
POSITIVE LOGITS
<eos>
0.77
_
0.68
[
0.68
–
0.68
B
0.65
(
0.65
fa
0.63
R
0.63
F
0.63
ma
0.62
Activations Density 2.745%