INDEX
    Explanations

    criteria for non-discrimination and equal opportunities based on traits like race, gender, sexual orientation, and disability

    New Auto-Interp
    Negative Logits
    <bos>
    -3.40
    -1.06
    
    
    -0.93
    /**
    -0.90
    <?
    -0.87
    /*
    -0.83
    /*++
    -0.77
    <?
    
    -0.77
     find
    -0.71
    <!--
    
    -0.70
    POSITIVE LOGITS
     Juf
    1.70
     Minang
    1.67
     stockholm
    1.61
     aen
    1.60
     bandung
    1.59
     dises
    1.58
     riviera
    1.53
     thut
    1.53
     Augu
    1.53
     maer
    1.53
    Act Density 0.062%

    No Known Activations