INDEX
Explanations
various types of underscores and hyphens in the text
URLs and emails
New Auto-Interp
Negative Logits
(
-0.41
aplicable
-0.38
want
-0.38
cár
-0.36
-
-0.36
receive
-0.35
récents
-0.34
comprob
-0.33
loopt
-0.33
decenas
-0.33
POSITIVE LOGITS
ſehen
0.90
0.81
niſſe
0.79
betweenstory
0.79
ſicht
0.79
+#+
0.78
propOrder
0.78
⟬
0.76
#+#
0.75
ſſung
0.75
Activations Density 0.023%