INDEX
    Explanations

    instances of the word "therefore."

    New Auto-Interp
    Negative Logits
    ing
    -0.87
    er
    -0.68
    man
    -0.67
    hol
    -0.64
    yr
    -0.63
     ge
    -0.62
     Sha
    -0.58
    </em>
    -0.58
    dup
    -0.57
    frac
    -0.57
    POSITIVE LOGITS
     therefore
    2.36
     Therefore
    2.12
    Therefore
    2.08
    therefore
    2.04
     therefor
    1.85
    Поэтому
    1.56
     Daarom
    1.47
     Derfor
    1.46
     Portanto
    1.46
     Deshalb
    1.40
    Act Density 0.078%

    No Known Activations