INDEX
Explanations
phrases indicating possession or affiliation with someone or something
references to personal ownership or self-identity
New Auto-Interp
Negative Logits
hesda
-0.79
antle
-0.74
xual
-0.74
romeda
-0.74
anwhile
-0.73
olic
-0.72
MER
-0.71
obar
-0.71
oms
-0.70
ILCS
-0.69
POSITIVE LOGITS
personal
0.84
accord
0.83
backyard
0.78
version
0.76
affairs
0.75
hars
0.74
selves
0.74
creations
0.73
admission
0.72
downfall
0.70
Activations Density 0.037%