INDEX
    Explanations

    development

    New Auto-Interp
    Negative Logits
     development
    -0.69
     Development
    -0.63
    Development
    -0.57
     DEVELOPMENT
    -0.56
    development
    -0.52
    ]!='
    -0.50
     desenvolvimento
    -0.44
    entyfik
    -0.44
     розвитку
    -0.43
     ontwikkeling
    -0.42
    POSITIVE LOGITS
     يتيمه
    0.79
     للمعارف
    0.73
    adaptiveStyles
    0.69
     Chwiliwch
    0.69
     écoulé
    0.67
     of
    0.66
    مراجع
    0.66
    DrawerToggle
    0.65
    Παραπομπές
    0.64
     Denk
    0.62
    Act Density 0.020%

    No Known Activations