INDEX
Explanations
mentions of the word "Man" or related variations
terminology and phrases associated with masculinity and male identity
New Auto-Interp
Negative Logits
DragonMagazine
-0.87
Transcript
-0.79
Birth
-0.77
catentry
-0.77
RP
-0.75
RIP
-0.73
DEV
-0.72
Balt
-0.72
Mer
-0.71
externalToEVAOnly
-0.70
POSITIVE LOGITS
anchester
0.78
apes
0.70
pronoun
0.69
ape
0.68
lda
0.68
Mara
0.66
Viet
0.66
ufact
0.66
Samoa
0.65
atown
0.65
Activations Density 0.291%