INDEX
Explanations
phrases related to actions done without permission or consent
instances of the word "without" and phrases implying lack or absence
New Auto-Interp
Negative Logits
late
-0.80
raq
-0.78
lated
-0.68
soon
-0.68
lyak
-0.68
=-=-=-=-=-=-=-=-
-0.68
berman
-0.67
onen
-0.66
Ranked
-0.66
aez
-0.65
POSITIVE LOGITS
bothering
1.23
realizing
1.19
noticing
1.17
hesitation
1.17
mentioning
1.07
knowing
1.06
blinking
1.04
specifying
1.03
interruption
1.03
exception
1.00
Activations Density 0.049%