INDEX
Explanations
instances where the phrase "no" is immediately followed by the number 9
statements of negation or lack of certainty
New Auto-Interp
Negative Logits
rex
-0.71
ale
-0.70
ammad
-0.70
leaning
-0.66
south
-0.66
interstitial
-0.61
often
-0.61
everyone
-0.60
tenance
-0.59
rhet
-0.59
POSITIVE LOGITS
clue
1.28
idea
1.07
shortage
1.04
illusions
1.04
doubt
1.04
intention
1.04
excuse
1.02
qual
1.01
chance
0.98
recourse
0.97
Activations Density 0.047%