INDEX
    Explanations

    mentions of drunkenness or drunk-related activities, especially focused on drunk driving

    references to intoxication or impairment due to alcohol

    New Auto-Interp
    Negative Logits
     Flavoring
    -0.98
    pta
    -0.82
    pha
    -0.76
     JPM
    -0.73
    isite
    -0.73
    TPP
    -0.70
    ocol
    -0.66
     Hosp
    -0.66
    CVE
    -0.66
    Downloadha
    -0.65
    POSITIVE LOGITS
    ards
    0.95
     manslaughter
    0.89
     drunk
    0.86
     driving
    0.84
    driving
    0.84
    ard
    0.83
     Driving
    0.79
     underage
    0.78
     arrest
    0.76
    bott
    0.74
    Act Density 0.064%

    No Known Activations