INDEX
    Explanations

    mentions of the word "sex" along with related words

    references to sexual themes or concepts

    New Auto-Interp
    Negative Logits
    ģĸ
    -0.76
    BLIC
    -0.73
     Hasan
    -0.69
     Sawyer
    -0.68
     landfall
    -0.68
     Lafayette
    -0.67
    Ĭ±
    -0.65
     antioxid
    -0.63
    Dispatch
    -0.62
    £ı
    -0.62
    POSITIVE LOGITS
    ually
    1.09
    iest
    1.02
     trafficking
    0.97
    ercise
    0.91
    uality
    0.90
    odus
    0.90
    agen
    0.88
    ier
    0.87
     offender
    0.86
     hormones
    0.85
    Act Density 0.033%

    No Known Activations