INDEX
    Explanations

    words that indicate common practices or norms

    New Auto-Interp
    Negative Logits
    INC
    -0.70
     Orchestra
    -0.64
    amins
    -0.64
    bern
    -0.63
    eday
    -0.63
     possibly
    -0.63
     Vital
    -0.61
    Wr
    -0.60
     Kut
    -0.60
    Posts
    -0.59
    POSITIVE LOGITS
    entimes
    0.82
     consist
    0.81
     consists
    0.81
     comprise
    0.76
     abbrevi
    0.76
     consisted
    0.74
    ãĤ©
    0.74
     disclaim
    0.74
     refers
    0.73
     comprised
    0.73
    Act Density 0.026%

    No Known Activations