INDEX
    Explanations

    sentences expressing emotions and intentions related to sharing, hope, and connection

    New Auto-Interp
    Negative Logits
     yourself
    -1.04
    yourself
    -0.84
     Yourself
    -0.80
     YOURSELF
    -0.79
     jezelf
    -0.66
    Yourself
    -0.62
     me
    -0.62
    mine
    -0.59
     your
    -0.58
     comigo
    -0.57
    POSITIVE LOGITS
     vocês
    1.09
     you
    1.07
     ustedes
    1.03
     yall
    0.94
     متعلقه
    0.81
     jullie
    0.81
     تكبرها
    0.78
     vosotros
    0.76
     виправивши
    0.75
     thee
    0.74
    Act Density 0.234%

    No Known Activations