INDEX
Explanations
patterns related to male individuals
references to male individuals or informal terms for men
New Auto-Interp
Negative Logits
èª
-0.85
theless
-0.79
tnc
-0.73
illus
-0.70
Parables
-0.70
Destination
-0.70
Decre
-0.68
ãĤ¯
-0.67
lyak
-0.64
Import
-0.63
POSITIVE LOGITS
abase
0.95
who
0.89
holes
0.85
opausal
0.84
else
0.79
heads
0.79
named
0.73
jeans
0.73
WithNo
0.72
hole
0.71
Activations Density 0.072%