INDEX
    Explanations

    warnings or alerts expressed in various contexts

    instances of the word "warned" and its variations

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.73
    anuts
    -0.66
    animate
    -0.66
     ILCS
    -0.66
    á
    -0.64
    dx
    -0.63
    portion
    -0.61
    NAS
    -0.61
    rafted
    -0.60
    adesh
    -0.59
    POSITIVE LOGITS
    warn
    1.04
    warning
    0.95
     Warn
    0.89
     omin
    0.86
     warnings
    0.82
    ingly
    0.81
     warns
    0.80
     caution
    0.78
     warn
    0.77
     warning
    0.74
    Act Density 0.018%

    No Known Activations