INDEX
    Explanations

    references to social justice and community issues

    New Auto-Interp
    Negative Logits
    ovit
    -0.15
    edo
    -0.14
    cales
    -0.14
    §
    -0.14
    iers
    -0.14
    .appspot
    -0.14
    704
    -0.13
     appe
    -0.13
     Naval
    -0.13
     Lans
    -0.13
    POSITIVE LOGITS
     dignity
    0.16
    istrat
    0.15
    RestController
    0.15
    illac
    0.15
    orges
    0.15
    .proc
    0.15
     Ñĩего
    0.14
     Plantae
    0.14
     dign
    0.14
    akens
    0.14
    Act Density 0.009%

    No Known Activations