INDEX
    Explanations

    references to nudity and sexual actions

    New Auto-Interp
    Negative Logits
     препратки
    -0.76
    DockStyle
    -0.72
    +:+
    -0.71
    KURZBESCHREIBUNG
    -0.69
    twimg
    -0.69
    ScopeManager
    -0.65
    setVerticalGroup
    -0.64
     Roskov
    -0.64
    homonymie
    -0.63
    oznam
    -0.62
    POSITIVE LOGITS
     naked
    1.63
     nude
    1.40
    naked
    1.40
     nudity
    1.37
     Naked
    1.33
     bare
    1.22
    Naked
    1.22
     exposing
    1.19
     desn
    1.18
    1.14
    Act Density 0.150%

    No Known Activations