INDEX
    Explanations

    discussions related to teaching, guidance, and supportive relationships

    New Auto-Interp
    Negative Logits
     we
    -1.28
     We
    -1.07
    We
    -0.96
     me
    -0.95
     I
    -0.88
    we
    -0.85
     мы
    -0.77
     us
    -0.73
     я
    -0.70
     WE
    -0.70
    POSITIVE LOGITS
     our
    1.75
     my
    1.51
     nuestros
    1.31
    our
    1.29
     nossos
    1.28
    Our
    1.20
     nosso
    1.16
    my
    1.14
    我的
    1.14
     nuestro
    1.13
    Act Density 0.564%

    No Known Activations