INDEX
    Explanations

    own / self / possession

    New Auto-Interp
    Negative Logits
     budete
    0.35
     Ihrer
    0.30
    elijke
    0.29
    )。
    0.29
     तुम्ही
    0.29
     ਤੁਹਾ
    0.29
     તમારી
    0.29
    你可以
    0.28
     вашей
    0.28
    ת
    0.28
    POSITIVE LOGITS
     own
    0.41
     próprio
    0.32
    selves
    0.31
     respective
    0.30
     próprias
    0.29
     propio
    0.28
     propres
    0.26
     propios
    0.26
     namesake
    0.26
     próprios
    0.26
    Act Density 0.463%

    No Known Activations