INDEX
    Explanations

    negative sentiment or criticism

    New Auto-Interp
    Negative Logits
     stoked
    -0.63
     Sultan
    -0.61
     lax
    -0.60
     convol
    -0.60
     ranks
    -0.59
    lement
    -0.57
    ISTER
    -0.56
     wedd
    -0.56
     Archdemon
    -0.55
     stagn
    -0.55
    POSITIVE LOGITS
    purpose
    0.80
    sight
    0.78
    task
    0.76
    oeuv
    0.76
    each
    0.76
    package
    0.76
    die
    0.75
    sent
    0.74
    street
    0.74
    one
    0.74
    Act Density 0.015%

    No Known Activations