INDEX
Negative Logits
yip
-0.65
xual
-0.65
kees
-0.64
ndum
-0.60
hepatitis
-0.59
ccording
-0.59
76561
-0.58
waters
-0.57
igated
-0.56
srf
-0.56
POSITIVE LOGITS
enson
0.89
Cheney
0.86
erson
0.82
inson
0.81
ible
0.79
Grayson
0.76
ERSON
0.75
ie
0.73
buster
0.69
ass
0.69
Activations Density 8.159%