INDEX
    Explanations

    phrases related to safety and medical guidelines

    New Auto-Interp
    Negative Logits
    ValueStyle
    -0.59
    SerializedName
    -0.53
     tweaked
    -0.53
     pretty
    -0.52
    anskje
    -0.52
     Thankfully
    -0.51
     Interestingly
    -0.51
     just
    -0.51
     arguably
    -0.51
     hopefully
    -0.50
    POSITIVE LOGITS
     الرياضيه
    0.72
    ſelf
    0.69
     متعلقه
    0.67
     poichè
    0.64
    Never
    0.63
    Consult
    0.61
     Moslem
    0.61
    ेशा
    0.60
    awsze
    0.60
    ATTENTION
    0.60
    Act Density 0.170%

    No Known Activations