INDEX
Explanations
phrases indicating significant issues or notable characteristics
New Auto-Interp
Negative Logits
\{\\-0.85
AutoScaleMode
-0.76
يميديا
-0.69
expandindo
-0.69
piele
-0.68
Administrativna
-0.67
Shakspeare
-0.65
Shaksp
-0.63
Roskov
-0.62
downvotes
-0.61
POSITIVE LOGITS
question
0.58
那就是
0.56
:
0.54
namely
0.53
namely
0.52
——
0.51
called
0.51
—
0.51
I
0.51
withIdentifier
0.49
Activations Density 0.521%