INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    rah
    -1.17
    RAH
    -0.82
    ItemLayout
    -0.80
    ār
    -0.79
     بيها
    -0.77
     незавершена
    -0.77
     propOrder
    -0.76
     disambiguazione
    -0.72
    TintMode
    -0.71
    رشف
    -0.70
    POSITIVE LOGITS
    #
    0.49
     Pim
    0.46
    0.45
    WebServlet
    0.45
    asa
    0.45
    es
    0.44
    vers
    0.44
     estekak
    0.44
     d
    0.44
     tục
    0.43
    Act Density 0.254%

    No Known Activations