INDEX
Explanations
punctuation marks, particularly periods
web file extensions
New Auto-Interp
Negative Logits
rungsseite
-0.95
AndEndTag
-0.85
myſelf
-0.82
AnchorStyles
-0.81
IntoConstraints
-0.79
<unused1>
-0.75
<unused41>
-0.75
<unused28>
-0.74
<unused16>
-0.74
<unused3>
-0.74
POSITIVE LOGITS
.
0.37
+".
0.35
}.
0.35
+'.
0.35
.
0.33
。
0.32
$.
0.32
_.
0.32
.
0.32
}$.
0.31
Activations Density 0.018%