INDEX
Explanations
gender-specific possessive pronouns followed by body parts
references to possession and familial relationships
New Auto-Interp
Negative Logits
ablishment
-0.91
iliate
-0.75
Massacre
-0.73
Meadows
-0.72
ancial
-0.70
Recomm
-0.69
Invasion
-0.66
ãĥ´ãĤ¡
-0.65
ete
-0.65
WAR
-0.64
POSITIVE LOGITS
palms
1.47
knees
1.42
hips
1.42
fingertips
1.41
chin
1.40
fingers
1.38
elbows
1.35
arms
1.35
fists
1.35
gaze
1.34
Activations Density 0.116%