INDEX
Explanations
HTML block elements and their attributes
New Auto-Interp
Negative Logits
<eos>
-0.99
(
-0.82
?
-0.80
.
-0.80
\
-0.79
B
-0.76
;
-0.76
1
-0.75
\
-0.74
U
-0.74
POSITIVE LOGITS
itſelf
1.63
Efq
1.51
myſelf
1.50
ſelves
1.34
ſeveral
1.34
faſt
1.31
themſelves
1.25
pleaſure
1.22
purpoſe
1.22
doubtnut
1.20
Activations Density 0.066%