INDEX
    Explanations

    the word "each" in various contexts

    New Auto-Interp
    Negative Logits
    er
    -0.64
     SUT
    -0.62
    im
    -0.62
     Goy
    -0.61
     pinos
    -0.60
    shit
    -0.57
     Bon
    -0.57
     fores
    -0.56
     Wiseman
    -0.56
    ter
    -0.56
    POSITIVE LOGITS
    EACH
    2.30
     each
    2.23
     EACH
    2.18
     Each
    2.12
    each
    2.11
    Each
    2.08
    Chaque
    1.84
     Chaque
    1.74
     chaque
    1.66
     ciasc
    1.59
    Act Density 0.101%

    No Known Activations