INDEX
Explanations
dates or time-related information, like months and years
negative scores or results associated with sports events
New Auto-Interp
Negative Logits
Yelp
-0.91
Boone
-0.89
Lovecraft
-0.89
Whitman
-0.86
Indiana
-0.84
Burr
-0.84
Idaho
-0.83
Dickinson
-0.83
Madison
-0.83
ACLU
-0.83
POSITIVE LOGITS
organ
0.97
placed
0.97
colour
0.95
organise
0.91
backed
0.91
shaped
0.90
Af
0.90
Pakistan
0.90
petrol
0.89
kil
0.88
Activations Density 0.198%