INDEX
    Explanations

    phrases indicating negation or contradiction

    phrases that negate or dismiss certain ideas or concepts

    New Auto-Interp
    Negative Logits
    Reviewed
    -0.71
     Metatron
    -0.67
     whichever
    -0.57
     Polk
    -0.57
    GBT
    -0.56
     Orth
    -0.55
     Stew
    -0.54
    perse
    -0.54
    kefeller
    -0.53
    phrine
    -0.53
    POSITIVE LOGITS
    xious
    0.90
     uncertain
    0.90
    onday
    0.89
    xus
    0.87
    ct
    0.78
     conceivable
    0.78
     avail
    0.74
     particular
    0.73
    osphere
    0.72
    otrop
    0.70
    Act Density 0.028%

    No Known Activations