INDEX
    Explanations

    references to socio-economic disparity

    New Auto-Interp
    Negative Logits
     sometimes
    -0.15
     McGr
    -0.15
    ÂŃt
    -0.14
    ascript
    -0.14
     eg
    -0.14
    isay
    -0.14
     Binder
    -0.14
     Ledger
    -0.14
     overall
    -0.13
     selfish
    -0.13
    POSITIVE LOGITS
    uppet
    0.16
    alet
    0.16
     scarc
    0.15
     Wall
    0.15
    bern
    0.14
    wall
    0.14
    occo
    0.14
    REET
    0.14
    Wall
    0.14
    reet
    0.14
    Act Density 0.000%

    No Known Activations