INDEX
Negative Logits
affe
-0.90
Geh
-0.89
Able
-0.87
hack
-0.85
Wellington
-0.83
erb
-0.82
Grey
-0.81
Barg
-0.81
unic
-0.81
Doodle
-0.81
POSITIVE LOGITS
sidx
0.94
WATCHED
0.94
evidenced
0.91
intended
0.88
actionDate
0.87
liking
0.87
ãĤ¡
0.86
maxwell
0.86
ription
0.84
indicated
0.83
Activations Density 1.067%