INDEX
    Explanations

    mentions of alcohol consumption and its implications

    New Auto-Interp
    Negative Logits
     feeder
    -0.16
    olini
    -0.16
     Breakfast
    -0.15
     hungry
    -0.15
     Candy
    -0.15
     bake
    -0.15
     Bath
    -0.15
     Bake
    -0.14
     Soap
    -0.14
    Chocolate
    -0.14
    POSITIVE LOGITS
     alcohol
    0.49
    éħĴ
    0.46
     alcoholic
    0.43
     Alcohol
    0.42
     drink
    0.41
     алког
    0.39
     rượu
    0.38
     booze
    0.38
     beer
    0.37
    cohol
    0.37
    Act Density 0.512%

    No Known Activations