INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     habet
    -0.45
     malaise
    -0.40
    licherweise
    -0.38
    uta
    -0.34
     Chá
    -0.34
     femen
    -0.33
    -0.33
     forever
    -0.33
    usti
    -0.32
     übere
    -0.32
    POSITIVE LOGITS
    ImageContext
    1.04
     AssemblyCulture
    1.03
     مشين
    0.93
    OGND
    0.91
    findpost
    0.81
    abestanden
    0.81
     Wicidata
    0.81
     tartalomajánló
    0.81
    TagMode
    0.79
     bezeichneter
    0.79
    Act Density 0.419%

    No Known Activations