INDEX
    Explanations

    mentions of the name "Gerry" at varying activations

    references to the word "berry" and its variations

    New Auto-Interp
    Negative Logits
    agos
    -0.90
    isting
    -0.76
    ahime
    -0.75
    aepernick
    -0.75
    iesta
    -0.75
    agonist
    -0.75
    icago
    -0.75
    awar
    -0.74
    ouch
    -0.74
    anooga
    -0.73
    POSITIVE LOGITS
    mand
    1.05
    mite
    0.79
    erry
    0.78
    ments
    0.77
     Gerry
    0.76
    mph
    0.74
    bye
    0.73
    nda
    0.72
    llo
    0.71
    ng
    0.70
    Act Density 0.018%

    No Known Activations