INDEX
    Explanations

    negative sentiments or expressions of dislike

    New Auto-Interp
    Negative Logits
    elles
    -0.16
    enha
    -0.15
    erland
    -0.15
    575
    -0.14
    aux
    -0.14
    /template
    -0.14
    ulk
    -0.14
    ãĤ«ãĥ¼
    -0.14
    .tk
    -0.14
    675
    -0.13
    POSITIVE LOGITS
    enez
    0.14
     suspense
    0.14
    furt
    0.14
    ема
    0.14
    bower
    0.14
     touch
    0.14
     Surprise
    0.14
    奴
    0.13
    touch
    0.13
    jay
    0.13
    Act Density 0.083%

    No Known Activations