INDEX
Explanations
instances of the word "prompt" or its variants
New Auto-Interp
Negative Logits
igi
-0.17
iesel
-0.16
wi
-0.15
ERN
-0.14
hou
-0.14
apo
-0.14
bre
-0.14
lect
-0.14
Bold
-0.14
lector
-0.14
POSITIVE LOGITS
æĿIJ
0.16
conditions
0.15
oron
0.15
zilla
0.15
.generated
0.14
stit
0.14
oko
0.14
blem
0.14
lém
0.14
sticks
0.14
Activations Density 0.009%