INDEX
Explanations
instances related to driving, vehicles, and car accidents
mentions of drivers in various contexts
New Auto-Interp
Negative Logits
iversal
-0.80
aeper
-0.79
yss
-0.78
achu
-0.77
Flavoring
-0.76
ertain
-0.73
ropolitan
-0.72
reme
-0.72
pta
-0.72
orkshire
-0.71
POSITIVE LOGITS
drivers
0.94
driver
0.93
pige
0.90
driving
0.84
driver
0.83
less
0.83
bees
0.80
driving
0.78
Drivers
0.78
wings
0.77
Activations Density 0.016%