INDEX
Explanations
punctuated sentence endings
New Auto-Interp
Negative Logits
millan
-0.66
"
-0.64
"
-0.63
'
-0.61
(
-0.60
Tind
-0.60
-0.59
"]
-0.59
I
-0.59
Mann
-0.59
POSITIVE LOGITS
poffible
1.19
avoient
1.15
Theſe
1.13
.&
1.11
Monfieur
1.11
uſed
1.11
myſelf
1.10
Jefus
1.09
ſeveral
1.07
étoient
1.05
Activations Density 0.383%