INDEX
Explanations
references to dangerous driving behavior
references to ongoing situations or conditions
New Auto-Interp
Negative Logits
ansson
-0.79
ovies
-0.68
aeus
-0.67
itsch
-0.66
Zucker
-0.65
essional
-0.64
Io
-0.64
Roberts
-0.64
EVA
-0.64
INST
-0.63
POSITIVE LOGITS
versions
0.69
punk
0.66
cies
0.65
tar
0.63
breaching
0.62
tha
0.60
grading
0.60
finding
0.58
resa
0.58
rave
0.57
Activations Density 0.000%