INDEX
    Explanations

    the presence of special characters and formatting within the text

    New Auto-Interp
    Negative Logits
    kaar
    -0.16
    ynos
    -0.15
     ì½ĺ
    -0.14
    ario
    -0.14
    -END
    -0.14
    imli
    -0.14
    گذ
    -0.14
    ارÙģ
    -0.14
    rego
    -0.14
    zyst
    -0.14
    POSITIVE LOGITS
    eyse
    0.17
    MG
    0.15
    hn
    0.15
    æķħ
    0.14
     Morgan
    0.14
    ToWorld
    0.14
    entr
    0.14
    .FontStyle
    0.14
    raÄį
    0.14
     Universal
    0.14
    Act Density 0.005%

    No Known Activations