INDEX
    Explanations

    themes of competition and achievement

    New Auto-Interp
    Negative Logits
    ıza
    -0.13
    utdown
    -0.12
     нÑĸÑı
    -0.11
    ÑıÑĤи
    -0.11
    inci
    -0.11
    λικά
    -0.11
    еÑĦ
    -0.11
    irement
    -0.11
    άβ
    -0.10
    udad
    -0.10
    POSITIVE LOGITS
     one
    1.45
    one
    0.95
     ONE
    0.89
    _one
    0.89
    -one
    0.85
     One
    0.85
    One
    0.83
    .one
    0.82
     uno
    0.80
     одного
    0.75
    Act Density 2.746%

    No Known Activations