INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    اط
    -0.08
     nag
    -0.08
    nag
    -0.07
     abb
    -0.07
     mins
    -0.07
     cognitive
    -0.07
     οι
    -0.07
     teg
    -0.07
    -0.07
     Isabelle
    -0.07
    POSITIVE LOGITS
    ಂಧ
    0.08
     hospitalized
    0.08
    0.08
     gasolina
    0.08
    0.07
     سيارة
    0.07
    Lugar
    0.07
    ('/',
    0.07
     Мен
    0.07
     miest
    0.07
    Act Density 0.005%

    No Known Activations