INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    WillAppear
    -0.52
     <<<<<<<<<<<<<<
    -0.47
    idhi
    -0.47
     通販
    -0.45
    ))]
    -0.44
    modelBuilder
    -0.44
    ]]]
    -0.43
    ())))
    -0.42
    "]))
    -0.42
    "]]
    -0.41
    POSITIVE LOGITS
     للاسماء
    0.87
     Waray
    0.76
     المعيارى
    0.76
     mariée
    0.73
     ujednoznacz
    0.70
    PyExc
    0.69
    Jereo
    0.69
    Демографія
    0.68
    Discriminator
    0.68
     Audiodateien
    0.66
    Act Density 0.083%

    No Known Activations