INDEX
Explanations
proper nouns, specifically surnames
names of people
New Auto-Interp
Negative Logits
ò
-0.85
cumbers
-0.84
conflic
-0.81
eleph
-0.79
exting
-0.78
ñ
-0.74
Orderable
-0.74
aditional
-0.74
ô
-0.74
Takeru
-0.72
POSITIVE LOGITS
man
1.94
mans
1.70
MAN
1.58
mann
1.49
men
1.36
eman
1.19
fman
1.19
mania
1.18
Man
1.15
woman
1.11
Activations Density 0.083%