INDEX
    Explanations

    terms related to stereotypes, particularly in the context of stereotyping and its implications

    New Auto-Interp
    Negative Logits
     يتيمه
    -0.93
    SharedDtor
    -0.79
    ollectionView
    -0.78
    Composable
    -0.77
    Cubit
    -0.76
    WriteAttribute
    -0.75
    windigkeit
    -0.73
    cách
    -0.73
    EnableWeb
    -0.73
    первых
    -0.72
    POSITIVE LOGITS
     stere
    1.64
     Stere
    1.63
    Stere
    1.45
     stereo
    1.22
     Stereo
    1.17
    stere
    1.09
     stereotype
    1.09
     stereotypes
    1.07
    Stereo
    0.96
     stereotyp
    0.85
    Act Density 0.003%

    No Known Activations