INDEX
Explanations
possessive pronouns and references to ownership or personal relationships
New Auto-Interp
Negative Logits
eele
-0.68
Izan
-0.64
ablishment
-0.63
arthed
-0.63
Unch
-0.60
Illum
-0.60
vine
-0.59
ocument
-0.58
Frazier
-0.57
river
-0.57
POSITIVE LOGITS
own
1.55
OWN
1.11
self
1.11
selves
1.03
Own
0.98
elf
0.93
rightful
0.93
respective
0.87
fullest
0.87
allotted
0.85
Activations Density 0.115%