INDEX
Explanations
punctuation marks and variation in sentence endings
Follows periods at the end of sentences
German, Danish, Russian, Slavic, English words followed by specific punctuation or function words
New Auto-Interp
Negative Logits
aka
-0.83
incentiv
-0.80
showcasing
-0.77
aka
-0.77
leveraging
-0.71
AKA
-0.71
AKA
-0.69
Additionally
-0.68
للاسماء
-0.68
FYI
-0.66
POSITIVE LOGITS
Daß
0.81
doubtless
0.69
forthwith
0.60
UserScript
0.59
scarcely
0.59
ибо
0.59
läßt
0.58
Надо
0.57
daß
0.54
muß
0.53
Activations Density 0.593%