INDEX
Explanations
mentions of drunkenness or drunk-related activities, especially focused on drunk driving
references to intoxication or impairment due to alcohol
New Auto-Interp
Negative Logits
Flavoring
-0.98
pta
-0.82
pha
-0.76
JPM
-0.73
isite
-0.73
TPP
-0.70
ocol
-0.66
Hosp
-0.66
CVE
-0.66
Downloadha
-0.65
POSITIVE LOGITS
ards
0.95
manslaughter
0.89
drunk
0.86
driving
0.84
driving
0.84
ard
0.83
Driving
0.79
underage
0.78
arrest
0.76
bott
0.74
Activations Density 0.064%