INDEX
    Explanations

    reflections on personal mistakes and accountability for one's actions

    New Auto-Interp
    Negative Logits
    ecz
    -0.08
     addCriterion
    -0.08
    lopen
    -0.07
    ndo
    -0.07
    loat
    -0.07
    uib
    -0.07
    uros
    -0.07
    izi
    -0.07
    Keywords
    -0.07
    vit
    -0.07
    POSITIVE LOGITS
     drugs
    0.07
     rational
    0.07
     actions
    0.07
     Rational
    0.07
     acting
    0.06
     behavior
    0.06
     drug
    0.06
     role
    0.06
     choices
    0.06
     part
    0.06
    Act Density 0.030%

    No Known Activations