INDEX
    Explanations

    strongly worded expressions or emphasis, such as 'damn' or 'darn'

    expressions of strong frustration or emphasis

    New Auto-Interp
    Negative Logits
    NetMessage
    -1.10
    KY
    -0.76
    ramid
    -0.75
    Interstitial
    -0.75
    KER
    -0.72
    CRE
    -0.71
    idon
    -0.70
     membr
    -0.68
    cn
    -0.68
    chn
    -0.68
    POSITIVE LOGITS
     darn
    0.86
     damn
    0.83
    selves
    0.83
    ibly
    0.79
     damned
    0.76
    holes
    0.73
    ation
    0.72
     kidding
    0.71
    wit
    0.71
    nuts
    0.70
    Act Density 0.018%

    No Known Activations