INDEX
    Explanations

    references to negative outcomes or quantitative losses

    instances of the word "losses" and related terms

    New Auto-Interp
    Negative Logits
    Created
    -0.71
    pol
    -0.66
    dayName
    -0.66
    JB
    -0.64
    Fram
    -0.63
    cart
    -0.63
    ISTER
    -0.63
    Offic
    -0.62
    bara
    -0.61
    pter
    -0.61
    POSITIVE LOGITS
     losses
    3.83
     loss
    2.33
    loss
    2.18
     Loss
    2.13
     defeats
    1.90
     loses
    1.74
     setbacks
    1.67
     losers
    1.65
     victories
    1.58
     failures
    1.55
    Act Density 0.014%

    No Known Activations