INDEX
    Explanations

    the word "little" in various contexts

    New Auto-Interp
    Negative Logits
    uality
    -0.17
    landa
    -0.17
    oe
    -0.17
    /doc
    -0.15
    i
    -0.15
    ron
    -0.14
    ing
    -0.14
    eded
    -0.14
    apesh
    -0.14
    ÙĪÙ¾
    -0.14
    POSITIVE LOGITS
     bit
    0.38
    -known
    0.34
    -bit
    0.31
    bit
    0.29
    Bits
    0.26
    .bit
    0.25
    /big
    0.24
    _bit
    0.23
    /tiny
    0.22
     Bit
    0.22
    Act Density 0.050%

    No Known Activations