INDEX
    Explanations

    instances of punctuation or breaks in text, particularly periods

    New Auto-Interp
    Negative Logits
    ulus
    -0.14
    mine
    -0.14
    odel
    -0.14
    zim
    -0.14
    resentation
    -0.14
    addtogroup
    -0.14
     dara
    -0.14
     Alley
    -0.14
    usat
    -0.14
    ализа
    -0.14
    POSITIVE LOGITS
    461
    0.16
    aus
    0.15
    owitz
    0.15
     wards
    0.14
    å¹¹
    0.14
    лаÑĤи
    0.14
    اÙĨÙĩ
    0.14
    ichel
    0.14
     Stanton
    0.14
    ãĤ¹ãĥŀ
    0.14
    Act Density 0.002%

    No Known Activations