INDEX
    Explanations

    references to volunteering and community involvement

    New Auto-Interp
    Negative Logits
     Homo
    -0.16
    егоÑĢ
    -0.15
    oba
    -0.15
    Porno
    -0.15
    _RESP
    -0.15
     masculinity
    -0.15
    prung
    -0.14
     homo
    -0.14
     semen
    -0.13
    ikon
    -0.13
    POSITIVE LOGITS
     sor
    0.25
     Sor
    0.25
     alum
    0.24
     Girl
    0.23
     Junior
    0.21
     Juliet
    0.21
     girls
    0.21
     Jun
    0.20
     girl
    0.19
    Girl
    0.19
    Act Density 0.013%

    No Known Activations