INDEX
    Explanations

    direct quotations

    quotation marks or speech indicators in the text

    New Auto-Interp
    Negative Logits
     accomp
    -0.85
     carrier
    -0.84
     removable
    -0.82
     cleanup
    -0.81
     adjud
    -0.78
     disproportionately
    -0.77
     cubic
    -0.76
     dominate
    -0.75
     flared
    -0.75
     replacement
    -0.75
    POSITIVE LOGITS
    We
    1.46
    Absolutely
    1.44
    There
    1.42
    I
    1.41
    Whoever
    1.40
    It
    1.38
    If
    1.36
    Obviously
    1.36
    You
    1.35
    Personally
    1.34
    Act Density 0.126%

    No Known Activations