INDEX
Explanations
discussions related to fairness and subjective opinions
New Auto-Interp
Negative Logits
Jefus
-1.02
متعلقه
-0.95
Paglinawan
-0.93
виправивши
-0.93
nahilalakip
-0.92
بوابة
-0.92
―――――
-0.91
photolibrary
-0.90
$_"
-0.89
auffi
-0.88
POSITIVE LOGITS
I
0.58
I
0.48
0.47
[
0.42
Whilst
0.41
ion
0.41
is
0.40
isn
0.38
esta
0.38
used
0.37
Activations Density 0.301%