INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    共同
    -0.11
     নিজেদের
    -0.10
     juntos
    -0.10
     gemeinsamen
    -0.09
     collectively
    -0.09
     gezamen
    -0.09
     gemeinsam
    -0.09
     jointly
    -0.09
    一起
    -0.09
     collaboratively
    -0.09
    POSITIVE LOGITS
     తన
    0.10
     തന്റെ
    0.09
     sozinho
    0.09
     solo
    0.08
     thesis
    0.08
     himself
    0.08
     herself
    0.07
    afx
    0.07
     pragma
    0.07
     jezelf
    0.07
    Act Density 0.025%

    No Known Activations