INDEX
    Explanations

    phrases that promote critical thinking and discussion around societal issues

    New Auto-Interp
    Negative Logits
     yourself
    -0.19
     à¤īसन
    -0.19
     Ø®ÙĪØ¯Ø´
    -0.15
    imler
    -0.15
     ê·¸ëĬĶ
    -0.14
     Ø¢ÙĨ
    -0.14
     itself
    -0.14
     nó
    -0.14
     ihtiyac
    -0.14
    oretical
    -0.13
    POSITIVE LOGITS
     their
    1.39
    their
    1.23
    Their
    1.16
     Their
    1.16
     THEIR
    1.02
     иÑħ
    1.01
     jejich
    0.95
     leurs
    0.94
     loro
    0.93
     leur
    0.93
    Act Density 3.450%

    No Known Activations