INDEX
    Explanations

    phrases that express intention or purpose

    New Auto-Interp
    Negative Logits
     виправивши
    -0.59
    šinou
    -0.57
     conmigo
    -0.56
     officielles
    -0.54
     كورة
    -0.53
    almaz
    -0.53
     vanligt
    -0.52
     chofe
    -0.52
     forskj
    -0.52
    AddHtmlAttribute
    -0.52
    POSITIVE LOGITS
     чтобы
    0.91
     afin
    0.89
     כדי
    0.88
     Để
    0.86
    Để
    0.84
    为了
    0.83
     upang
    0.82
    ůli
    0.81
    為了
    0.80
     щоб
    0.79
    Act Density 0.159%

    No Known Activations