INDEX
    Explanations

    names of people and places

    references to a specific brand or product related to beer

    New Auto-Interp
    Negative Logits
    ALLY
    -0.73
    LESS
    -0.68
     GOODMAN
    -0.68
    acion
    -0.67
    ulators
    -0.66
    ORE
    -0.66
    eering
    -0.64
    amide
    -0.63
    ARC
    -0.60
    CAR
    -0.60
    POSITIVE LOGITS
    ught
    1.17
    fters
    1.15
    cffff
    1.04
    enei
    0.97
    dra
    0.94
    plets
    0.92
    ven
    0.91
    isine
    0.87
    uth
    0.85
    fter
    0.84
    Act Density 0.012%

    No Known Activations