INDEX
    Explanations

    terms related to consequences and penalties for wrongdoings

    New Auto-Interp
    Negative Logits
    áºł
    -0.07
    SCO
    -0.07
    ucha
    -0.07
     nav
    -0.06
     somehow
    -0.06
    igham
    -0.06
    otor
    -0.06
    kili
    -0.06
    ONGO
    -0.06
    ighbors
    -0.06
    POSITIVE LOGITS
     permanently
    0.09
    δη
    0.08
     suspension
    0.07
     ban
    0.07
     permanent
    0.07
     temporarily
    0.07
    roker
    0.07
     perman
    0.07
     loss
    0.06
     Bans
    0.06
    Act Density 0.013%

    No Known Activations