INDEX
    Explanations

    contexts related to social justice and advocacy

    New Auto-Interp
    Negative Logits
    986
    -0.14
     hallmark
    -0.14
    ubbo
    -0.14
    air
    -0.14
     çĶŁåij½åij¨æľŁ
    -0.13
     RPC
    -0.13
     dog
    -0.13
    oling
    -0.13
    ucker
    -0.13
    abad
    -0.13
    POSITIVE LOGITS
     source
    0.20
    source
    0.18
    δή
    0.16
    lude
    0.16
     indicator
    0.15
    æºIJ
    0.15
    tool
    0.14
    spring
    0.14
    -force
    0.14
     proof
    0.14
    Act Density 0.202%

    No Known Activations