INDEX
    Explanations

    phrases related to a high level of completeness or uniformity

    instances of the word "completely" to emphasize totality or absoluteness

    New Auto-Interp
    Negative Logits
    maid
    -0.86
    liest
    -0.71
    pires
    -0.70
    rers
    -0.69
    llor
    -0.68
    ĺħ
    -0.67
    hao
    -0.66
    AMY
    -0.66
    yip
    -0.66
    pring
    -0.65
    POSITIVE LOGITS
    BuyableInstoreAndOnline
    0.77
     reliant
    0.73
     disreg
    0.72
     unrelated
    0.72
     obliter
    0.71
     accomplished
    0.69
     depends
    0.68
     annihil
    0.68
     rewritten
    0.67
     overhaul
    0.67
    Act Density 0.020%

    No Known Activations