INDEX
Explanations
locations specified in a specific format like city and state abbreviations or city and country
location identifiers, specifically state abbreviations in the United States
New Auto-Interp
Negative Logits
ufact
-0.84
lihood
-0.82
renheit
-0.75
andowski
-0.71
toggle
-0.71
holding
-0.69
months
-0.68
é¾įå
-0.68
paragraph
-0.67
ror
-0.67
POSITIVE LOGITS
Dept
0.78
./
0.72
ARA
0.70
FF
0.67
ADA
0.65
Supervisor
0.64
JR
0.64
FC
0.63
IFF
0.62
BB
0.62
Activations Density 0.037%