INDEX
    Explanations

    male/female sex and gender

    New Auto-Interp
    Negative Logits
    reminder
    0.49
    moved
    0.45
    belum
    0.43
    waved
    0.43
     கடல்
    0.42
    icano
    0.42
    istatic
    0.42
    brains
    0.41
    weathermap
    0.40
    otemporal
    0.40
    POSITIVE LOGITS
     kız
    0.52
     gender
    0.50
     Gender
    0.50
     sex
    0.49
     girls
    0.48
     Girls
    0.46
    女孩
    0.46
     parents
    0.45
     والدین
    0.44
     ønsk
    0.44
    Act Density 0.024%

    No Known Activations