INDEX
Explanations
phrases beginning with the words "Can you"
questions and requests for assistance involving the word "you."
New Auto-Interp
Negative Logits
ylum
-0.72
hyde
-0.67
Flags
-0.64
teen
-0.63
history
-0.62
Lenin
-0.61
mails
-0.60
assic
-0.60
artifacts
-0.58
Respons
-0.58
POSITIVE LOGITS
afford
0.96
tell
0.87
accommodate
0.87
reconcile
0.84
safely
0.83
convince
0.81
conceive
0.81
be
0.81
compare
0.80
quantify
0.79
Activations Density 0.040%