INDEX
Explanations
phrases related to criticism or disagreement
instances of commas in the text
New Auto-Interp
Negative Logits
aimon
-0.71
pps
-0.71
ãĥīãĥ©
-0.70
ãĤ£
-0.70
mage
-0.69
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.69
iang
-0.68
iple
-0.68
ISE
-0.66
ource
-0.64
POSITIVE LOGITS
saying
1.91
stating
1.72
noting
1.71
insisting
1.64
claiming
1.58
arguing
1.57
citing
1.54
asserting
1.49
accusing
1.47
pointing
1.47
Activations Density 0.254%