INDEX
Explanations
pronouns and possessive nouns
references to possession or ownership
New Auto-Interp
Negative Logits
eers
-0.82
netflix
-0.82
river
-0.77
Frazier
-0.76
hari
-0.72
ahead
-0.72
haus
-0.72
Uriel
-0.71
Tree
-0.70
ingle
-0.69
POSITIVE LOGITS
own
1.68
newfound
1.16
inability
1.15
predicament
1.14
intentions
1.13
actions
1.07
plight
1.06
impending
1.06
dealings
1.05
predecessors
1.03
Activations Density 0.376%