INDEX
    Explanations

    words related to medical conditions, treatments, and procedures that may involve potential risk or harm

    New Auto-Interp
    Negative Logits
     McGee
    -0.66
    Ô
    -0.63
     bye
    -0.61
     Skinner
    -0.59
    Browser
    -0.58
     Journalists
    -0.58
     Kimber
    -0.57
     McCabe
    -0.56
     Ri
    -0.55
     Fri
    -0.55
    POSITIVE LOGITS
    rophic
    1.16
    rophe
    0.92
    anship
    0.85
    roph
    0.81
    asis
    0.80
    atic
    0.79
    otypes
    0.78
    ucle
    0.77
    oration
    0.77
    istics
    0.77
    Act Density 0.049%

    No Known Activations