INDEX
Explanations
references to male characters or archetypes
man, man's life, man born
New Auto-Interp
Negative Logits
uque
-0.55
totic
-0.52
willi
-0.52
futile
-0.50
oros
-0.47
grandiose
-0.46
uture
-0.45
eclip
-0.45
IVE
-0.45
ibat
-0.44
POSITIVE LOGITS
man
1.68
Man
1.23
woman
1.20
homem
1.11
hombre
1.08
Man
1.05
Woman
1.04
homme
1.03
uomo
1.01
men
0.99
Activations Density 0.013%