INDEX
Explanations
mentions of roads or road-related incidents and safety measures
New Auto-Interp
Negative Logits
arians
-0.85
ividual
-0.82
illian
-0.76
irements
-0.68
Hots
-0.67
tle
-0.67
rator
-0.65
emort
-0.65
uates
-0.64
ropolitan
-0.64
POSITIVE LOGITS
ways
1.27
blocks
1.21
trip
1.18
block
1.04
side
1.01
show
0.98
map
0.94
hog
0.91
fare
0.89
runner
0.88
Activations Density 0.026%