INDEX
    Explanations

    positive adjectives

    New Auto-Interp
    Negative Logits
     mêmes
    -0.50
     autorytatywna
    -0.47
     contenus
    -0.45
     conseguenza
    -0.43
    новништво
    -0.42
    Ante
    -0.42
     aikaa
    -0.41
     aussieht
    -0.41
     вещей
    -0.41
    Ad
    -0.40
    POSITIVE LOGITS
     are
    0.76
    >{@
    0.74
     ويكيميديا
    0.73
     externi
    0.69
     they
    0.69
    DrawerToggle
    0.68
     were
    0.65
     allAfrica
    0.63
    UnusedPrivate
    0.60
     HttpHeaders
    0.59
    Act Density 0.001%

    No Known Activations