INDEX
Negative Logits
yip
-0.91
Lur
-0.63
bluff
-0.63
EStream
-0.62
EStreamFrame
-0.62
Carson
-0.61
street
-0.60
specificity
-0.60
Ô
-0.59
tert
-0.59
POSITIVE LOGITS
ables
0.92
ense
0.90
ations
0.86
nces
0.85
inarily
0.84
rencies
0.84
aun
0.82
ous
0.80
ername
0.79
ences
0.79
Activations Density 0.108%