INDEX
Explanations
phrases related to family members, especially referring to the roles of "Mom" and "Dad"
references to 'Mom' and variations of familial relationships
New Auto-Interp
Negative Logits
lihood
-0.67
chnology
-0.65
claimants
-0.62
Ferdinand
-0.61
Commonwealth
-0.61
Tribunal
-0.60
similarities
-0.60
Dull
-0.60
Squadron
-0.58
extrap
-0.58
POSITIVE LOGITS
my
1.47
ma
1.13
mers
1.09
hesis
1.07
entary
1.06
iji
1.01
mer
1.01
MY
0.95
pered
0.92
puter
0.90
Activations Density 0.033%