INDEX
Explanations
phrases related to causing or inciting action
instances of the word "prompt" and its variations, indicating triggers for actions or responses
New Auto-Interp
Negative Logits
à©
-0.64
Nanto
-0.63
cannabinoid
-0.62
aird
-0.62
Cortex
-0.61
Leaves
-0.61
Sham
-0.61
avia
-0.61
Armen
-0.60
ihar
-0.59
POSITIVE LOGITS
prompt
1.17
prompts
1.06
prompting
0.93
prompted
0.86
iration
0.84
ously
0.81
showc
0.78
rising
0.77
gers
0.75
FontSize
0.74
Activations Density 0.017%