INDEX
Explanations
pronouns referring to family relationships and newborns
New Auto-Interp
Negative Logits
Rounds
-0.69
Kitty
-0.66
SIGN
-0.65
Salon
-0.65
PN
-0.65
GMT
-0.65
Pharaoh
-0.64
Ov
-0.64
Rothschild
-0.62
Mobil
-0.62
POSITIVE LOGITS
selves
1.32
lightly
1.13
pecially
1.09
atisf
1.09
aying
1.07
ELF
1.04
ources
1.04
ustainable
1.03
omew
1.03
uddenly
1.03
Activations Density 9.212%