INDEX
    Explanations

    phrases related to comparisons or contrasting statements

    punctuation marks, particularly commas

    New Auto-Interp
    Negative Logits
    ãĥ¥
    -0.75
    -,
    -0.69
    arily
    -0.63
    ãĤ´
    -0.63
    Detailed
    -0.63
    MpServer
    -0.62
    ,...
    -0.62
    ļéĨĴ
    -0.60
    inar
    -0.60
    Includes
    -0.60
    POSITIVE LOGITS
     however
    1.20
     though
    1.04
     therefore
    0.88
     it
    0.87
     although
    0.86
     we
    0.82
     moreover
    0.80
     yes
    0.78
     there
    0.78
     they
    0.76
    Act Density 0.253%

    No Known Activations