INDEX
    Explanations

    occurrences of specific characters or symbols, particularly parentheses and the ampersand symbol

    New Auto-Interp
    Negative Logits
     cih
    -0.15
    itchens
    -0.15
    .gov
    -0.14
    иÑģÑģ
    -0.14
    äºĮ人
    -0.14
    à¹ij
    -0.14
    anas
    -0.14
    harma
    -0.14
    allery
    -0.13
    /Branch
    -0.13
    POSITIVE LOGITS
    emie
    0.21
    prox
    0.16
    tsy
    0.16
     pul
    0.16
    elastic
    0.15
    infeld
    0.14
    acom
    0.14
     minds
    0.14
    grav
    0.14
    ONA
    0.14
    Act Density 0.005%

    No Known Activations