INDEX
Explanations
personal pronouns followed by verbs
instances of the pronoun "I" and expressions of personal experience or actions
New Auto-Interp
Negative Logits
impunity
-0.86
endif
-0.72
whichever
-0.65
indistinguishable
-0.63
margins
-0.63
inaction
-0.63
coercive
-0.61
srfAttach
-0.61
limits
-0.60
escal
-0.60
POSITIVE LOGITS
've
1.31
awoke
1.24
'm
1.22
woke
1.16
stanbul
1.10
stumbled
1.06
recently
1.01
adore
0.99
arrived
0.98
LOVE
0.97
Activations Density 0.215%