INDEX
    Explanations

    references to medication and health-related advice

    New Auto-Interp
    Negative Logits
    anja
    -0.16
    ORIA
    -0.15
    bid
    -0.15
    reta
    -0.15
    oria
    -0.15
    uffman
    -0.14
    ihan
    -0.14
    upon
    -0.14
    .finish
    -0.14
    ниÑĨ
    -0.14
    POSITIVE LOGITS
     your
    0.20
    your
    0.18
     Tip
    0.18
     ÑģÑĤа
    0.18
     you
    0.16
    Tip
    0.16
     yourself
    0.15
     tip
    0.15
     Synd
    0.15
    ä½łçļĦ
    0.15
    Act Density 0.267%

    No Known Activations