INDEX
Negative Logits
Dating
-0.68
hene
-0.67
ophobia
-0.66
Merit
-0.62
Discrimination
-0.61
:(
-0.61
quit
-0.60
THING
-0.60
REAL
-0.60
anymore
-0.60
POSITIVE LOGITS
assorted
0.87
accompan
0.82
nearby
0.81
respectively
0.79
complement
0.78
cellaneous
0.78
optionally
0.76
accompany
0.75
lieu
0.75
standby
0.75
Activations Density 0.419%