INDEX
    Explanations

    statements regarding organizations' missions and goals

    New Auto-Interp
    Negative Logits
    ertain
    -0.15
    xd
    -0.15
    744
    -0.14
    erence
    -0.14
    [from
    -0.14
    ÙIJÙħ
    -0.13
    емÑĥ
    -0.13
    thon
    -0.13
    istrovstvÃŃ
    -0.13
    taire
    -0.13
    POSITIVE LOGITS
     tw
    0.43
     simple
    0.31
     Tw
    0.23
     three
    0.22
     straightforward
    0.22
    simple
    0.21
     clear
    0.21
    ç®Ģåįķ
    0.21
     dual
    0.20
    _tw
    0.20
    Act Density 0.056%

    No Known Activations