INDEX
    Explanations

    negations indicating something is not what one might expect or assume

    phrases indicating negation or disapproval

    New Auto-Interp
    Negative Logits
    å¥
    -0.72
    ership
    -0.69
    éĥ
    -0.67
    éĹĺ
    -0.67
    PsyNetMessage
    -0.67
    iers
    -0.66
    Writer
    -0.65
     Tours
    -0.65
     Inventory
    -0.64
    luence
    -0.63
    POSITIVE LOGITS
     necessarily
    1.31
    icably
    1.21
    icable
    1.11
     exactly
    1.07
    orious
    1.02
    epad
    1.00
     yet
    0.98
    withstanding
    0.97
     entirely
    0.92
    eworthy
    0.92
    Act Density 0.187%

    No Known Activations