INDEX
    Explanations

    references to binaries and dichotomies, especially those related to gender and societal roles

    New Auto-Interp
    Negative Logits
    oldt
    -0.17
    olib
    -0.16
    327
    -0.14
    inu
    -0.14
    factory
    -0.14
     jack
    -0.14
    .ax
    -0.14
     factory
    -0.14
    itag
    -0.14
    aze
    -0.14
    POSITIVE LOGITS
    ساس
    0.19
    /Application
    0.15
    inputEmail
    0.14
    à¥įमà¤ķ
    0.14
    )prepare
    0.14
    é¤
    0.14
    updatedAt
    0.14
    _continuous
    0.14
    OSH
    0.14
    DataExchange
    0.14
    Act Density 0.179%

    No Known Activations