INDEX
    Explanations

    negative statements or negations

    negations and the phrase "isn't" in various contexts

    New Auto-Interp
    Negative Logits
    tein
    -0.74
    gnu
    -0.66
     creations
    -0.60
     Properties
    -0.58
     endeavors
    -0.58
     Supported
    -0.57
    doms
    -0.55
     WARN
    -0.54
     pursuits
    -0.54
    ritch
    -0.53
    POSITIVE LOGITS
    hin
    1.01
    ibaba
    1.00
     anybody
    1.00
     enough
    0.99
     unanim
    0.98
     anyone
    0.98
     anything
    0.97
     any
    0.96
    enough
    0.93
     room
    0.90
    Act Density 0.079%

    No Known Activations