INDEX
    Explanations

    expressions of emotional distress and concern for victims

    New Auto-Interp
    Negative Logits
    ÙĪÙĦÙĬÙĪ
    -0.08
    .Interop
    -0.08
    .Magenta
    -0.07
    à¥įतर
    -0.07
    _cre
    -0.07
    ddit
    -0.07
    еÑĢп
    -0.07
    _KP
    -0.07
    _VENDOR
    -0.07
    .BorderFactory
    -0.07
    POSITIVE LOGITS
    affe
    0.07
     Wit
    0.06
     Kot
    0.06
    apiro
    0.06
     tran
    0.06
    gil
    0.06
    eme
    0.06
    uat
    0.06
     âĸ²
    0.05
    è½
    0.05
    Act Density 0.002%

    No Known Activations