INDEX
    Explanations

    occurrences of the words "each" and "every."

    New Auto-Interp
    Negative Logits
    anzi
    -0.16
    uss
    -0.15
    quets
    -0.15
    quent
    -0.14
    еÑĢÑĤа
    -0.14
     AREA
    -0.14
    ers
    -0.14
    ulo
    -0.14
    ulace
    -0.14
    ous
    -0.14
    POSITIVE LOGITS
     domic
    0.15
    .scalablytyped
    0.15
    oldem
    0.14
    ritis
    0.14
    aring
    0.14
     ç¤
    0.14
    .Theme
    0.13
    ENA
    0.13
    Hallo
    0.13
    erais
    0.13
    Act Density 0.035%

    No Known Activations