INDEX
    Explanations

    phrases related to giving advice or warnings

    the contraction "don't."

    New Auto-Interp
    Negative Logits
     Penguin
    -0.73
     Reloaded
    -0.73
     SetTextColor
    -0.73
    ĪĴ
    -0.71
     Colour
    -0.71
     Radiation
    -0.69
    £ı
    -0.69
     Defin
    -0.68
     Dangerous
    -0.66
     Positive
    -0.66
    POSITIVE LOGITS
     necessarily
    0.92
    cha
    0.89
    ween
    0.88
    ting
    0.82
    ional
    0.80
     anymore
    0.80
    aper
    0.80
    ardless
    0.79
     know
    0.79
    rave
    0.78
    Act Density 0.081%

    No Known Activations