INDEX
    Explanations

    phrases indicating social interactions and individual relationships

    language names and technical terms

    New Auto-Interp
    Negative Logits
    ium
    -0.29
     кӀ
    -0.26
     yılında
    -0.25
     pernas
    -0.25
    featureID
    -0.25
    oredCriteria
    -0.24
     удалось
    -0.24
     diper
    -0.24
    новниш
    -0.24
    Superficie
    -0.24
    POSITIVE LOGITS
    adaptiveStyles
    0.65
    haikusbot
    0.64
    новништво
    0.63
    MLLoader
    0.59
    httphttps
    0.58
    iſchen
    0.58
    niſſe
    0.56
    iſche
    0.55
     Disqus
    0.54
     translators
    0.53
    Act Density 0.075%

    No Known Activations