INDEX
    Explanations

    emotionally charged adjectives

    New Auto-Interp
    Negative Logits
     cycle
    -0.67
     GEAR
    -0.63
     waivers
    -0.62
    Origin
    -0.61
     minors
    -0.61
     RI
    -0.59
     Korea
    -0.58
     ok
    -0.57
     ALSO
    -0.57
     damages
    -0.56
    POSITIVE LOGITS
    acious
    1.10
    ignant
    0.97
    entious
    0.96
    arious
    0.93
    uous
    0.91
    romising
    0.91
    oried
    0.89
    ious
    0.89
    worldly
    0.87
    ithering
    0.87
    Act Density 0.268%

    No Known Activations