INDEX
    Explanations

    thoughts and reflections on identity and societal perceptions

    New Auto-Interp
    Negative Logits
     TSR
    -0.07
    uw
    -0.07
    orias
    -0.07
    /GPL
    -0.06
    valuator
    -0.06
     queryInterface
    -0.06
    iversite
    -0.06
    rlen
    -0.06
     overt
    -0.06
    å¡ļ
    -0.06
    POSITIVE LOGITS
     nor
    0.15
     Nor
    0.12
     Nope
    0.10
     sondern
    0.10
    Nor
    0.10
    nor
    0.10
     sino
    0.09
    ãĤĢ
    0.08
     بÙĦÚ©Ùĩ
    0.07
    éĤ£æł·
    0.07
    Act Density 0.024%

    No Known Activations