INDEX
    Explanations

    the occurrence of the word "small" in various contexts

    New Auto-Interp
    Negative Logits
    iras
    -0.16
    ifu
    -0.15
    bsolute
    -0.14
    ookies
    -0.14
    ookie
    -0.14
    jar
    -0.14
    ushi
    -0.13
     sola
    -0.13
     Fleming
    -0.13
    285
    -0.13
    POSITIVE LOGITS
    /small
    0.23
    /tiny
    0.23
    -scale
    0.22
    -sized
    0.20
    scale
    0.19
    -small
    0.19
    /big
    0.18
     Sized
    0.17
    wares
    0.17
    cot
    0.17
    Act Density 0.038%

    No Known Activations