INDEX
    Explanations

    the word "not" in sentences

    New Auto-Interp
    Negative Logits
    Nice
    -0.68
    OSP
    -0.63
    å¥
    -0.63
     Measures
    -0.61
    Kings
    -0.60
    Intern
    -0.60
     Seasons
    -0.59
    Nine
    -0.59
     assessments
    -0.59
    IDENT
    -0.58
    POSITIVE LOGITS
     necessarily
    1.31
     tolerate
    1.16
    icably
    1.11
     hesitate
    1.07
     relent
    1.01
    ogle
    1.00
     be
    0.99
     allow
    0.97
    hin
    0.97
     bud
    0.94
    Act Density 0.071%

    No Known Activations