INDEX
    Explanations

    warnings or alerts

    instances of the word "warning."

    New Auto-Interp
    Negative Logits
    morph
    -0.84
    dx
    -0.80
    animate
    -0.77
    rencies
    -0.73
    adr
    -0.73
    anova
    -0.73
    ophon
    -0.72
    rafted
    -0.72
    tiny
    -0.72
    growth
    -0.72
    POSITIVE LOGITS
     warning
    1.18
     Warning
    1.17
     warnings
    1.07
     Warn
    1.04
    warning
    1.03
     warn
    0.99
     warns
    0.95
     caution
    0.92
    warn
    0.91
     disclaimer
    0.85
    Act Density 0.018%

    No Known Activations