INDEX
    Explanations

    references to various forms of sexual violence and related crimes

    New Auto-Interp
    Negative Logits
     Ung
    -0.15
    ieber
    -0.15
     Fatal
    -0.14
    _agents
    -0.14
    .annot
    -0.14
    weg
    -0.14
     Deadly
    -0.13
    agenta
    -0.13
    ispiel
    -0.13
    ico
    -0.13
    POSITIVE LOGITS
    hower
    0.15
    ấn
    0.15
    FIT
    0.15
    eding
    0.15
    ;base
    0.14
    à¤Ĥà¤ĸ
    0.14
    orz
    0.14
    kili
    0.14
    setFlash
    0.14
     STA
    0.14
    Act Density 0.026%

    No Known Activations