INDEX
Negative Logits
recur
-0.07
Cart
-0.07
divider
-0.07
_depart
-0.07
Ho
-0.07
mushrooms
-0.06
yönet
-0.06
Detective
-0.06
incurred
-0.06
ูช
-0.06
POSITIVE LOGITS
SSL
0.12
ssl
0.10
SSL
0.09
ssl
0.09
SL
0.08
znam
0.07
the
0.07
TL
0.07
lez
0.06
compulsory
0.06
Activations Density 0.004%