INDEX
    Explanations

    instances of the word "bit" within various contexts

    references to incremental or gradual changes

    New Auto-Interp
    Negative Logits
    ammad
    -0.76
    etheus
    -0.71
     pend
    -0.67
    theless
    -0.64
    velt
    -0.64
     Pend
    -0.64
     Vaj
    -0.64
    gaard
    -0.63
    anguage
    -0.63
     Duty
    -0.62
    POSITIVE LOGITS
    terness
    1.25
    umen
    1.13
    ches
    1.12
    buck
    1.02
    ching
    1.00
    wig
    0.99
    umin
    0.97
    chery
    0.97
    ters
    0.91
    meal
    0.91
    Act Density 0.026%

    No Known Activations