INDEX
Explanations
commands or instructions indicating desire or intention
the phrase "if you want."
New Auto-Interp
Negative Logits
errors
-0.70
enthusi
-0.66
mitter
-0.65
livious
-0.64
bish
-0.61
interstitial
-0.61
gren
-0.60
rawl
-0.60
iky
-0.59
otypes
-0.59
POSITIVE LOGITS
reprene
0.73
Continue
0.69
acion
0.66
purposes
0.65
to
0.64
sake
0.63
continuity
0.61
anything
0.61
succeed
0.60
access
0.60
Activations Density 0.049%