INDEX
    Explanations

    phrases related to drinking

    references to drinking alcohol

    New Auto-Interp
    Negative Logits
    CLASSIFIED
    -0.78
    rious
    -0.69
    aeda
    -0.64
     lat
    -0.64
     accompl
    -0.63
    Requ
    -0.63
     gre
    -0.63
    RIC
    -0.61
     suites
    -0.60
     Lans
    -0.60
    POSITIVE LOGITS
    emouth
    0.84
    ewater
    0.74
    umen
    0.71
    animal
    0.70
    çͰ
    0.70
    ecast
    0.69
    )(
    0.66
    emaker
    0.65
    bernatorial
    0.64
     Devi
    0.63
    Act Density 0.000%

    No Known Activations