INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    IntoConstraints
    -0.69
     <<<<<<<<<<<<<<
    -0.63
     Myster
    -0.62
     noqa
    -0.56
    UserScript
    -0.53
     surla
    -0.52
    fxml
    -0.52
     proxim
    -0.52
     bè
    -0.52
     PyLong
    -0.52
    POSITIVE LOGITS
    persons
    0.61
    دانشنامهٔ
    0.60
    ьаж
    0.51
    ítulos
    0.50
    woman
    0.48
    toThrow
    0.46
    men
    0.45
    women
    0.45
    outine
    0.45
    erapeutic
    0.44
    Act Density 0.877%

    No Known Activations