INDEX
Negative Logits
ViewFeatures
-0.90
Geplaatst
-0.90
تقاوى
-0.88
Breeders
-0.88
amaño
-0.87
ValueStyle
-0.86
reaſon
-0.85
IntoConstraints
-0.84
ſmall
-0.83
myſelf
-0.82
POSITIVE LOGITS
y
0.74
a
0.64
s
0.63
ship
0.59
ی
0.55
i
0.54
with
0.47
for
0.45
in
0.43
is
0.43
Activations Density 0.068%