INDEX
    Explanations

    phrases indicating a cause or reason for a particular outcome

    phrases indicating causality or consequences

    New Auto-Interp
    Negative Logits
     reinvent
    -0.71
     quietly
    -0.69
     sack
    -0.67
     electr
    -0.65
     tack
    -0.63
     dwell
    -0.62
     winning
    -0.61
     sinking
    -0.60
    horn
    -0.60
     sprint
    -0.60
    POSITIVE LOGITS
     Due
    3.36
    Due
    2.46
    due
    1.63
     Upon
    1.29
     Depending
    1.28
     due
    1.26
     Because
    1.22
     Since
    1.20
     Unfortunately
    1.17
     Prior
    1.16
    Act Density 0.015%

    No Known Activations