INDEX
    Explanations

    terms related to emotional distress and suffering

    New Auto-Interp
    Negative Logits
     distinct
    -0.17
    quis
    -0.16
    OrDefault
    -0.16
    itchens
    -0.16
    riv
    -0.16
    atty
    -0.15
     distint
    -0.14
    tdown
    -0.14
    ettel
    -0.14
    ark
    -0.14
    POSITIVE LOGITS
    ively
    0.26
    ingly
    0.22
    iveness
    0.21
    /dist
    0.18
    ritos
    0.17
    rella
    0.16
    ive
    0.16
    à¸Ń
    0.16
    inction
    0.15
    ycz
    0.15
    Act Density 0.020%

    No Known Activations