INDEX
    Explanations

    proper nouns, specifically names of people and organizations

    New Auto-Interp
    Negative Logits
     محفوظة
    -0.58
    enumii
    -0.53
    databinding
    -0.51
    PreferredItem
    -0.51
     ویکی‌پدیا
    -0.50
    وحة
    -0.50
    GetBytes
    -0.50
    Geplaatst
    -0.49
    myapplication
    -0.48
    enumi
    -0.48
    POSITIVE LOGITS
     Numerade
    0.60
     createState
    0.58
     >=",
    0.53
    InitVars
    0.52
    ühnen
    0.49
    jeka
    0.48
     SEDS
    0.48
    gyz
    0.48
    DrawerToggle
    0.47
    atrième
    0.47
    Act Density 0.534%

    No Known Activations