INDEX
Negative Logits
mileage
-0.65
ãĥł
-0.63
Caldwell
-0.61
Rosenthal
-0.59
Abs
-0.58
corrid
-0.58
¯¯¯¯
-0.57
Curiosity
-0.57
¯¯¯¯¯¯¯¯
-0.57
OB
-0.56
POSITIVE LOGITS
dates
1.53
olicy
1.24
dating
1.09
edia
1.07
stairs
1.07
olitan
1.07
odcast
1.07
rint
1.06
reme
1.06
resents
1.05
Activations Density 0.060%