INDEX
Explanations
references to the concept of "man," often referring to males or masculinity
occurrences of the word "man."
New Auto-Interp
Negative Logits
Grounds
-0.71
Ames
-0.67
ADC
-0.63
AVG
-0.63
Oath
-0.61
DERR
-0.60
Rivals
-0.59
outgoing
-0.59
Dame
-0.59
viz
-0.58
POSITIVE LOGITS
gling
1.33
hattan
1.32
ifest
1.30
nered
1.25
uscript
1.19
oeuv
1.16
agers
1.16
handled
1.14
made
1.13
ila
1.12
Activations Density 0.059%