INDEX
    Explanations

    references to gender inequality and discrimination in political contexts

    New Auto-Interp
    Negative Logits
    aler
    -0.15
    abin
    -0.15
    .Companion
    -0.15
    inalg
    -0.14
     via
    -0.14
    aggio
    -0.14
     ExecutionContext
    -0.14
    inish
    -0.13
    owler
    -0.13
    oller
    -0.13
    POSITIVE LOGITS
    Flip
    0.17
     flip
    0.17
    flip
    0.16
     Flip
    0.16
    763
    0.16
    lys
    0.14
    gross
    0.14
    íݸ
    0.14
     daher
    0.14
    imp
    0.13
    Act Density 0.660%

    No Known Activations