INDEX
    Explanations

    phrases with a negative connotation, particularly focusing on the word "no"

    negations or phrases indicating absence or denial

    New Auto-Interp
    Negative Logits
    rn
    -0.80
    often
    -0.72
    inarily
    -0.71
    ellect
    -0.68
    alian
    -0.67
    ahime
    -0.67
    deck
    -0.66
    schild
    -0.66
     WATCHED
    -0.66
    typically
    -0.65
    POSITIVE LOGITS
     exceptions
    1.11
     longer
    0.99
    xious
    0.97
     modifications
    0.95
     alteration
    0.93
     refunds
    0.91
     compromises
    0.90
     restrictions
    0.89
     surprises
    0.89
     additional
    0.88
    Act Density 0.106%

    No Known Activations