INDEX
Explanations
expressions of agreement or strong opinions
following pronouns or question words
program name and description
New Auto-Interp
Negative Logits
AssemblyCulture
-0.81
AndEndTag
-0.78
SourceChecksum
-0.78
</caption>
-0.76
الرياضيه
-0.74
!")
-0.72
momix
-0.70
SOUNDBITE
-0.68
^(@)
-0.68
ddelweddau
-0.67
POSITIVE LOGITS
What
0.77
Honestly
0.76
Honestly
0.75
Why
0.73
What
0.72
Wasn
0.70
I
0.70
I
0.68
Looks
0.68
Maybe
0.67
Activations Density 0.155%