INDEX
Explanations
mentions of the word "man" at varying strengths of activation
instances of the word "man" in various forms
New Auto-Interp
Negative Logits
ombo
-0.67
PsyNetMessage
-0.65
fuel
-0.63
ritical
-0.62
ignty
-0.62
heet
-0.62
fibre
-0.62
rolet
-0.61
avorite
-0.60
ifully
-0.59
POSITIVE LOGITS
ifest
1.40
iac
1.18
ning
1.09
iasis
1.08
hattan
1.06
ufact
1.04
uscript
0.99
agements
0.98
aging
0.97
ned
0.96
Activations Density 0.069%