INDEX
    Explanations

    references to legal or law enforcement entities and related terms

    references to "woman" and "roman" in various contexts, suggesting a focus on gender-related themes or characters

    New Auto-Interp
    Negative Logits
    casting
    -0.77
    Reviewer
    -0.77
     Emin
    -0.72
    llular
    -0.68
    UV
    -0.67
    IRO
    -0.67
    REE
    -0.67
    rared
    -0.66
    maxwell
    -0.66
    IRD
    -0.65
    POSITIVE LOGITS
    ufact
    1.30
    agement
    1.09
    oman
    1.08
    ry
    0.84
    ipal
    0.82
    ation
    0.81
    agements
    0.81
    thal
    0.80
    WithNo
    0.79
    ovember
    0.78
    Act Density 0.011%

    No Known Activations