INDEX
Explanations
references to the term "man" in various contexts
New Auto-Interp
Negative Logits
iesen
-0.21
gen
-0.18
go
-0.16
ë´
-0.16
ammers
-0.15
born
-0.15
ted
-0.15
grade
-0.15
й
-0.15
kart
-0.15
POSITIVE LOGITS
agements
0.23
hattan
0.23
iac
0.22
ifold
0.19
atee
0.18
agment
0.18
agers
0.18
chester
0.18
UEL
0.18
e
0.18
Activations Density 0.126%