INDEX
Explanations
instructions on how to perform specific tasks or procedures
second-person pronouns and directives addressing the reader
New Auto-Interp
Negative Logits
Georg
-0.59
millenn
-0.54
REDACTED
-0.54
ãĤ´ãĥ³
-0.54
earthqu
-0.53
adium
-0.53
amaz
-0.52
hearted
-0.52
Atlantis
-0.52
Joined
-0.52
POSITIVE LOGITS
'll
1.27
shouldn
1.20
MUST
1.16
're
1.15
need
1.15
should
1.11
SHOULD
1.10
want
1.08
NEED
1.08
might
1.06
Activations Density 0.161%