INDEX
Explanations
phrases related to technical instructions or guides
sentences that start with "This is" or similar structures
New Auto-Interp
Negative Logits
igators
-0.64
angs
-0.62
ievers
-0.61
aea
-0.61
igator
-0.60
luaj
-0.60
waukee
-0.60
selves
-0.60
elve
-0.59
iating
-0.59
POSITIVE LOGITS
my
0.90
NOT
0.87
an
0.83
another
0.83
definitely
0.81
a
0.79
probably
0.76
what
0.76
excerpt
0.75
why
0.74
Activations Density 0.079%