INDEX
    Explanations

    instances of the word "each" and related terms indicating distribution or individual items in a set

    New Auto-Interp
    Negative Logits
    amen
    -0.17
    igan
    -0.15
    iverse
    -0.15
    iw
    -0.15
    less
    -0.14
     eigen
    -0.13
     possessions
    -0.13
     resources
    -0.13
    ric
    -0.13
     Meh
    -0.13
    POSITIVE LOGITS
    each
    0.18
     respective
    0.18
    (each
    0.16
     each
    0.16
    ê°ģ
    0.15
    ãĥ³ãĥij
    0.15
    ergy
    0.15
    ebek
    0.15
    Each
    0.14
     каждого
    0.14
    Act Density 0.058%

    No Known Activations