INDEX
    Explanations

    the word "Not" indicating negation

    the phrase "Not" followed by various contexts indicating exceptions or disclaimers

    New Auto-Interp
    Negative Logits
    éĥ
    -0.76
    ç·
    -0.76
    éģ
    -0.72
    stakes
    -0.71
    kamp
    -0.71
    ãģ®ç
    -0.70
    æ©
    -0.69
    creen
    -0.69
    大
    -0.69
     Mehran
    -0.67
    POSITIVE LOGITS
    epad
    1.21
    withstanding
    1.16
    orious
    1.10
    icably
    1.07
     necessarily
    1.04
    eworthy
    0.98
    ices
    0.93
    ific
    0.90
    ifications
    0.89
    icia
    0.86
    Act Density 0.063%

    No Known Activations