INDEX
    Explanations

    causal relationships or reasoning in text through the use of the word "hence."

    the word "hence" used in various contexts

    New Auto-Interp
    Negative Logits
     Tasman
    -0.63
     hitter
    -0.62
    abies
    -0.61
    estation
    -0.59
     batter
    -0.59
    Bull
    -0.58
     Bastard
    -0.58
    >>>>
    -0.58
     battered
    -0.57
    Fram
    -0.57
    POSITIVE LOGITS
    forth
    1.92
    forward
    1.37
    entimes
    0.82
    far
    0.79
    apy
    0.78
    comings
    0.77
    pend
    0.77
    hua
    0.75
    apers
    0.75
    noon
    0.73
    Act Density 0.009%

    No Known Activations