INDEX
    Explanations

    references to community responsibilities and interactions

    New Auto-Interp
    Negative Logits
    让æĪij
    -0.17
     themselves
    -0.17
     Yourself
    -0.15
    лива
    -0.14
    šlo
    -0.14
     دارÙħ
    -0.14
    larım
    -0.14
     itself
    -0.14
     himself
    -0.13
     мне
    -0.13
    POSITIVE LOGITS
     our
    1.41
    our
    1.06
    æĪij们çļĦ
    1.04
     ourselves
    0.97
     ours
    0.96
     OUR
    0.93
     nosso
    0.89
    Our
    0.85
     nossa
    0.85
     Our
    0.84
    Act Density 1.838%

    No Known Activations