INDEX
    Explanations

    statements about social justice and equity

    New Auto-Interp
    Negative Logits
    fur
    -0.15
    avana
    -0.14
    åIJį
    -0.14
     Shuffle
    -0.13
    eka
    -0.13
    ediator
    -0.13
    ä¿ĿæĮģ
    -0.13
     Relax
    -0.13
     clin
    -0.13
    489
    -0.13
    POSITIVE LOGITS
     instead
    0.24
    instead
    0.21
     Instead
    0.19
     tomorrow
    0.19
    Instead
    0.19
     sustainable
    0.17
     better
    0.16
     truly
    0.16
     based
    0.15
     alternatives
    0.15
    Act Density 0.470%

    No Known Activations