INDEX
    Explanations

    occurrences of punctuations and formatting symbols in a text

    New Auto-Interp
    Negative Logits
    zew
    -0.15
    ÑĪов
    -0.15
    enler
    -0.15
    EDI
    -0.14
    ãĥ¥ãĥ¼
    -0.14
    IndexPath
    -0.14
    .addObject
    -0.14
    оди
    -0.14
    šk
    -0.13
    аÑĢÑĩ
    -0.13
    POSITIVE LOGITS
     Sab
    0.16
     New
    0.15
    izz
    0.15
    ido
    0.15
     
    0.14
    Âł
    0.14
     /
    0.14
    ib
    0.14
    emic
    0.14
    998
    0.14
    Act Density 0.041%

    No Known Activations