INDEX
Explanations
ranges or numerical values
numerical ranges or percentages
New Auto-Interp
Negative Logits
authorized
-0.61
estate
-0.60
wordpress
-0.60
erected
-0.59
ilyn
-0.59
pain
-0.58
Polk
-0.57
PUBLIC
-0.56
conserv
-0.56
Signs
-0.54
POSITIVE LOGITS
ousand
0.79
mph
0.73
PF
0.72
ãĤ´ãĥ³
0.72
20439
0.71
HK
0.70
tenance
0.70
sidx
0.69
AM
0.69
GHz
0.69
Activations Density 0.089%