INDEX
    Explanations

    descriptions of physical appearances and qualities

    New Auto-Interp
    Negative Logits
    apunov
    -0.67
     saya
    -0.66
    "!
    -0.61
     diyor
    -0.59
    ”!
    -0.58
    !!!”
    -0.58
     recomiendo
    -0.57
    !".
    -0.57
    !!!"
    -0.57
     citoyens
    -0.54
    POSITIVE LOGITS
     hadn
    0.81
    دانشنامهٔ
    0.69
     goddamn
    0.65
    Personensuche
    0.64
    webElement
    0.64
     fucking
    0.64
     ivelany
    0.63
     seventeen
    0.62
     practically
    0.61
    0.61
    Act Density 0.355%

    No Known Activations