INDEX
Explanations
possessive pronouns
possessive pronouns, particularly "his."
New Auto-Interp
Negative Logits
etheless
-0.64
both
-0.63
IVERS
-0.59
alike
-0.59
unden
-0.55
@#&
-0.49
amily
-0.49
personalities
-0.48
ANGE
-0.47
Cry
-0.47
POSITIVE LOGITS
/
1.60
or
1.45
/#
1.20
/
1.19
/,
1.19
/.
1.17
panic
1.16
/"
1.08
/)
1.05
/(
1.04
Activations Density 0.216%