INDEX
    Explanations

    punctuation marks, particularly periods at the end of sentences

    New Auto-Interp
    Negative Logits
     Nug
    -0.15
    phet
    -0.15
    anche
    -0.15
     navr
    -0.15
    asca
    -0.14
    affe
    -0.13
    .fm
    -0.13
    alker
    -0.13
    .interface
    -0.13
    unker
    -0.13
    POSITIVE LOGITS
    omat
    0.17
    ucc
    0.16
    ıa
    0.15
    uto
    0.15
    ús
    0.14
    ugins
    0.14
    _gap
    0.14
    SourceType
    0.14
    oci
    0.14
    urb
    0.14
    Act Density 0.000%

    No Known Activations