INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Yandex
    -0.47
     Nebu
    -0.45
     resourceCulture
    -0.44
     scă
    -0.44
     miniaturka
    -0.42
     Cinem
    -0.42
     незавершена
    -0.42
    enyum
    -0.41
    يع
    -0.41
     étoit
    -0.41
    POSITIVE LOGITS
     apart
    1.99
    apart
    1.75
     Apart
    1.63
    Apart
    1.51
     APART
    1.23
     aparte
    1.10
     aside
    0.95
     appart
    0.88
    aside
    0.86
     asunder
    0.86
    Act Density 0.005%

    No Known Activations