INDEX
Explanations
racial or group identifiers used in a negative or segregating way.
New Auto-Interp
Negative Logits
tartalomajánló
-0.65
Dữ
-0.65
Sucesor
-0.64
gynhyrchwyd
-0.64
<bos>
-0.63
########.
-0.62
__":
-0.59
adpleegd
-0.58
}]
-0.57
كومونز
-0.57
POSITIVE LOGITS
StoryboardSegue
0.56
kimse
0.52
autrefois
0.47
memoized
0.47
MLLoader
0.47
UnitTesting
0.46
.
0.44
ArgumentParser
0.44
SuccessListener
0.43
vertx
0.43
Activations Density 2.831%