INDEX
Explanations
punctuation marks, particularly periods and quotation marks
punctuation marks
speaking, commenting, saying
New Auto-Interp
Negative Logits
RegressionTest
-0.57
BoxShadow
-0.51
letra
-0.50
חיצוניים
-0.50
fotografía
-0.48
estoppel
-0.48
ográficos
-0.47
inconn
-0.47
endpush
-0.46
ınır
-0.46
POSITIVE LOGITS
Commenting
1.35
Speaking
1.16
commented
1.14
Speaking
1.09
commenting
1.08
speaking
1.07
Comment
1.03
said
0.97
said
0.93
commented
0.88
Activations Density 0.024%