INDEX
Explanations
the word "male"
New Auto-Interp
Negative Logits
Monfieur
-1.30
enumi
-1.14
ainfi
-1.12
auffi
-1.10
AccessorTable
-1.04
aveug
-1.00
avoient
-0.98
tombé
-0.98
Efq
-0.98
quelcon
-0.96
POSITIVE LOGITS
re
0.71
par
0.66
0.65
non
0.64
large
0.63
multi
0.61
“
0.59
his
0.57
«
0.56
local
0.56
Activations Density 1.815%