INDEX
    Explanations

    special characters and punctuation in the text

    New Auto-Interp
    Negative Logits
    <bos>
    -0.54
    valt
    -0.50
     Vale
    -0.49
     Kardinal
    -0.47
    >{"
    -0.46
    piecze
    -0.46
     bens
    -0.46
    `),
    -0.46
     AppCompatTheme
    -0.46
    ioterapia
    -0.45
    POSITIVE LOGITS
    :✨
    0.81
    tvguidetime
    0.81
     تانيه
    0.80
    rungsseite
    0.74
    tagHelperRunner
    0.70
    +#+#
    0.69
    fjspx
    0.68
     متعلقه
    0.68
     snippetHide
    0.68
    الحياه
    0.65
    Act Density 0.024%

    No Known Activations