INDEX
    Explanations

    phrases related to giving warnings or alerts

    instances of the word "warned" and its variations

    New Auto-Interp
    Negative Logits
    anuts
    -0.77
    Interstitial
    -0.74
     ILCS
    -0.74
    mite
    -0.68
    nan
    -0.67
    animate
    -0.66
    OVA
    -0.66
    á
    -0.66
    aredevil
    -0.64
    adesh
    -0.63
    POSITIVE LOGITS
     against
    0.98
    warn
    0.96
     omin
    0.96
    warning
    0.90
     caution
    0.84
     Warn
    0.81
     listeners
    0.78
    ingly
    0.78
     warnings
    0.75
     us
    0.75
    Act Density 0.025%

    No Known Activations