INDEX
    Explanations

    phrases related to causation or prediction of outcomes

    phrases indicating causality or outcomes

    New Auto-Interp
    Negative Logits
    oqu
    -0.65
    yd
    -0.63
    pmwiki
    -0.62
    Cas
    -0.59
    advertisement
    -0.59
    kus
    -0.59
    ascus
    -0.59
     Unknown
    -0.57
     Bastard
    -0.57
    Pic
    -0.57
    POSITIVE LOGITS
     someday
    0.86
    enance
    0.81
    geries
    0.81
    igate
    0.78
     tomorrow
    0.72
    gery
    0.71
    rued
    0.69
    lessly
    0.67
     some
    0.67
     sooner
    0.66
    Act Density 0.321%

    No Known Activations