INDEX
    Explanations

    punctuation and formatting elements in the text

    New Auto-Interp
    Negative Logits
     Tur
    -0.61
    Tur
    -0.59
     Crus
    -0.52
    𝙫
    -0.51
     היש
    -0.48
    livejournal
    -0.48
     villaggio
    -0.48
    inės
    -0.47
     Ter
    -0.46
    Kết
    -0.46
    POSITIVE LOGITS
    .$,
    1.34
    ,:),
    1.30
    ,-,
    1.27
    (",",
    1.26
    ,',
    1.26
    *,
    1.24
    ,",
    1.23
    ,,,
    1.23
    €,
    1.23
     {,
    1.22
    Act Density 3.925%

    No Known Activations