INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     girls
    -1.31
     female
    -1.25
     females
    -1.16
     Female
    -1.12
    female
    -1.09
    Female
    -1.02
    girls
    -1.01
     girl
    -0.98
     Females
    -0.98
     Girls
    -0.96
    POSITIVE LOGITS
    AddTagHelper
    0.78
    awtextra
    0.66
    DebuggerNonUser
    0.62
    TemporalType
    0.62
    nocześnie
    0.58
    WriteBarrier
    0.58
     and
    0.57
    TagHelpers
    0.57
     ImportError
    0.56
    новништво
    0.54
    Act Density 0.871%

    No Known Activations