INDEX
Negative Logits
ValueStyle
-0.61
[*]
-0.51
adecimal
-0.50
COUVER
-0.49
ghed
-0.48
enumi
-0.46
لع
-0.46
Belast
-0.46
Clik
-0.45
atile
-0.43
POSITIVE LOGITS
evos
0.64
олові
0.62
advisor
0.58
intios
0.57
للاسماء
0.57
ConstraintMaker
0.57
UnusedPrivate
0.56
adviser
0.56
Personensuche
0.56
Juana
0.56
Activations Density 0.001%