INDEX
    Explanations

    mentions of beverages, particularly alcoholic drinks

    New Auto-Interp
    Negative Logits
     Evaluation
    -0.50
     Evaluations
    -0.50
    raft
    -0.48
    UNTAIN
    -0.48
     Thin
    -0.48
    Thin
    -0.47
    Evaluation
    -0.47
    EVAL
    -0.46
     Profil
    -0.46
     evaluation
    -0.45
    POSITIVE LOGITS
    AndEndTag
    0.77
    Champagne
    0.50
     Champagne
    0.49
     champagne
    0.46
    yarnpkg
    0.43
     autorytatywna
    0.42
    ंदीखरीदारी
    0.42
    Personensuche
    0.42
    champagne
    0.42
    windowFixed
    0.40
    Act Density 0.288%

    No Known Activations