INDEX
    Explanations

    terms related to sexual conduct and abuse

    New Auto-Interp
    Negative Logits
     Passage
    -0.52
     fusca
    -0.48
     impression
    -0.47
    Сылтамалар
    -0.46
    Passage
    -0.45
    一片
    -0.45
     Markham
    -0.45
     commitments
    -0.45
    gentes
    -0.44
    standers
    -0.44
    POSITIVE LOGITS
    Sex
    1.01
    Sexual
    0.98
     Sex
    0.95
     sex
    0.94
     Sexual
    0.93
     sexual
    0.93
     sexuales
    0.92
     SEX
    0.91
    sex
    0.87
     sexuelle
    0.87
    Act Density 0.027%

    No Known Activations