INDEX
    Explanations

    phrases that attribute blame or responsibility in interpersonal conflicts and actions

    New Auto-Interp
    Negative Logits
    alue
    -0.19
    ushima
    -0.16
     spis
    -0.16
    SystemService
    -0.15
    IColor
    -0.15
     luder
    -0.15
    ÑĨвеÑĤ
    -0.14
    outine
    -0.14
    ewater
    -0.14
    vida
    -0.14
    POSITIVE LOGITS
     enabling
    0.15
    war
    0.15
     supply
    0.15
    å¨
    0.15
     precip
    0.15
    self
    0.14
    adil
    0.14
    b
    0.14
     choices
    0.14
    bler
    0.14
    Act Density 0.234%

    No Known Activations