INDEX
    Explanations

    references to blame and responsibility

    New Auto-Interp
    Negative Logits
    SequentialGroup
    -0.39
    obacz
    -0.38
    slidesPer
    -0.35
    retario
    -0.35
     reseña
    -0.34
     retos
    -0.34
    </thead>
    -0.33
    ().__
    -0.33
     lauk
    -0.33
    ButtonClicked
    -0.32
    POSITIVE LOGITS
     blame
    1.39
    blame
    1.30
     Blame
    1.21
    Blame
    1.18
     blamed
    1.17
     blames
    1.11
     blaming
    1.06
     attribution
    0.82
     culpa
    0.81
     fault
    0.77
    Act Density 0.642%

    No Known Activations