INDEX
Explanations
references to mothers and maternal relationships
New Auto-Interp
Negative Logits
hline
-0.66
ski
-0.65
GGLE
-0.65
冀
-0.62
كومونز
-0.62
ardust
-0.62
Darius
-0.61
Danilo
-0.61
înc
-0.61
#[
-0.60
POSITIVE LOGITS
mothers
1.50
Mothers
1.46
Mothers
1.46
MOTHER
1.43
MOTHER
1.42
Mother
1.39
mother
1.34
mother
1.28
Mother
1.27
mothers
1.19
Activations Density 0.057%