INDEX
Explanations
expressions of injustice and unfairness
New Auto-Interp
Negative Logits
-0.83
standart
-0.57
I
-0.57
-0.55
fucking
-0.54
this
-0.54
specifik
-0.52
you
-0.52
%
-0.51
+
-0.50
POSITIVE LOGITS
مشين
1.00
tvguidetime
0.91
freilich
0.87
};*/
0.86
sidemargin
0.85
*/;
0.85
)))));
0.84
*/;
0.82
GraphicsUnit
0.82
estekak
0.82
Activations Density 0.201%