INDEX
    Explanations

    punctuation and emotional expressions in text

    New Auto-Interp
    Negative Logits
    illez
    -0.16
    ÛĮ
    -0.15
    AREST
    -0.15
    ãĥ³ãĤ¸
    -0.14
    rief
    -0.14
    iest
    -0.14
    laces
    -0.14
    ambi
    -0.14
    vez
    -0.14
    اص
    -0.14
    POSITIVE LOGITS
    ãĥ³ãĤ°ãĥ«
    0.18
    retim
    0.16
    ither
    0.15
     ass
    0.15
    eo
    0.14
    yer
    0.14
     gian
    0.14
    Č
    0.14
    ux
    0.14
    ima
    0.13
    Act Density 0.018%

    No Known Activations