INDEX
Explanations
the presence of specific structural or formatting elements, such as sections and lists, in the document
Punctuation after questions or quotes
conditional statements and questions
New Auto-Interp
Negative Logits
enumii
-0.57
nė
-0.55
Simplemente
-0.54
simply
-0.52
Ironically
-0.52
Quite
-0.51
дописавши
-0.51
initially
-0.51
simply
-0.49
Incluso
-0.48
POSITIVE LOGITS
XYZ
1.16
blah
1.14
〇〇
0.99
X
0.99
!”.
0.98
xyz
0.97
!”,
0.93
○○
0.91
____
0.89
XYZ
0.86
Activations Density 0.241%