INDEX
    Explanations

    references to boys and gender in various contexts

    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -0.62
    AutoScaleMode
    -0.51
    Personensuche
    -0.50
    Tembelea
    -0.49
    \{\\
    -0.48
     linkovi
    -0.48
     الرياضيه
    -0.48
     تضيفلها
    -0.47
    inSlope
    -0.47
    protoimpl
    -0.45
    POSITIVE LOGITS
     scout
    0.77
     scouts
    0.71
     Scouts
    0.67
     Scout
    0.65
    FRIEND
    0.61
    scout
    0.58
    cout
    0.53
    hood
    0.53
    Scout
    0.52
    friend
    0.50
    Act Density 0.190%

    No Known Activations