INDEX
Explanations
punctuation marks, especially commas
New Auto-Interp
Negative Logits
}}$}
-1.02
ſelf
-0.95
Efq
-0.95
myſelf
-0.94
Majefty
-0.90
^(@)
-0.90
cherchés
-0.89
ſelves
-0.86
itſelf
-0.85
―――――
-0.85
POSITIVE LOGITS
I
0.89
it
0.80
you
0.76
<eos>
0.72
we
0.72
If
0.70
Do
0.69
He
0.68
0.67
The
0.66
Activations Density 0.155%