INDEX
Negative Logits
betweenstory
-0.91
للاسماء
-0.90
Roskov
-0.86
tagHelperRunner
-0.85
ConstraintMaker
-0.84
tartalomajánló
-0.83
للمعارف
-0.82
referenties
-0.81
تضيفلها
-0.78
évaluateur
-0.78
POSITIVE LOGITS
all
0.52
.
0.52
U
0.41
for
0.39
realizadas
0.39
!
0.38
mismas
0.38
for
0.37
U
0.35
,
0.35
Activations Density 0.009%