INDEX
    Explanations

    the word "catch" in various contexts

    New Auto-Interp
    Negative Logits
    bard
    -0.63
     princip
    -0.61
    sburgh
    -0.58
     annex
    -0.58
    die
    -0.58
    htar
    -0.58
    inburgh
    -0.57
     coerc
    -0.57
    enstein
    -0.57
    burgh
    -0.57
    POSITIVE LOGITS
    phrase
    1.08
    netflix
    0.77
     glimps
    0.73
    tails
    0.72
    ipes
    0.72
    weight
    0.71
    tail
    0.68
     Luffy
    0.68
    weights
    0.68
    amac
    0.68
    Act Density 0.022%

    No Known Activations