INDEX
    Explanations

    the occurrence of the word "been" in various contexts

    New Auto-Interp
    Negative Logits
     Being
    -0.29
    being
    -0.29
     being
    -0.28
    still
    -0.27
    Being
    -0.27
    -being
    -0.25
     STILL
    -0.23
     still
    -0.23
    被
    -0.22
     Still
    -0.20
    POSITIVE LOGITS
    /is
    0.27
     lately
    0.26
     recently
    0.24
     through
    0.23
     around
    0.23
     previously
    0.21
     since
    0.21
    Recently
    0.19
     able
    0.19
     Recently
    0.19
    Act Density 0.130%

    No Known Activations