INDEX
Explanations
phrases beginning with "Did you" prompting for engagement or response
questions and statements directed towards the audience or reader
New Auto-Interp
Negative Logits
Connector
-0.76
artifacts
-0.76
heter
-0.73
assemb
-0.72
limits
-0.69
presently
-0.68
yond
-0.67
Rel
-0.66
currently
-0.63
Dialogue
-0.62
POSITIVE LOGITS
catch
0.86
typo
0.86
originally
0.85
earlier
0.83
stumble
0.82
mistake
0.82
previously
0.81
mention
0.79
miss
0.78
last
0.75
Activations Density 0.183%