INDEX
    Explanations

    instances of the word "women"

    references to women and their roles or experiences in various contexts

    New Auto-Interp
    Negative Logits
     constitu
    -0.79
     heterogeneity
    -0.68
    arta
    -0.66
    ologically
    -0.64
     Keynes
    -0.64
     statist
    -0.63
    illin
    -0.63
    tics
    -0.62
    kson
    -0.62
    HOME
    -0.61
    POSITIVE LOGITS
    volent
    0.96
    wagen
    0.85
    uthor
    0.74
    istries
    0.74
    pher
    0.71
    ibly
    0.70
    ager
    0.68
    geoning
    0.68
     IMAGES
    0.67
     SUV
    0.64
    Act Density 0.128%

    No Known Activations