INDEX
    Explanations

    terms related to changes and enhancements in features or conditions

    New Auto-Interp
    Negative Logits
    RegressionTest
    -0.66
     nonUne
    -0.63
    WebElementEntity
    -0.57
     يتيمه
    -0.57
     kasarigan
    -0.56
    хьтан
    -0.55
     ddelweddau
    -0.53
     الرياضيه
    -0.51
    delwed
    -0.51
    مصادر
    -0.50
    POSITIVE LOGITS
     ideales
    0.39
     brancas
    0.39
     ideal
    0.37
     seragam
    0.36
    entist
    0.36
     femininos
    0.35
     prácti
    0.34
    rouw
    0.34
     uniform
    0.34
    ideal
    0.34
    Act Density 0.785%

    No Known Activations