INDEX
Explanations
commands or instructions in a conversation context
dialogue that expresses requests or commands
New Auto-Interp
Negative Logits
Flavoring
-0.79
seemingly
-0.76
etheless
-0.72
umerous
-0.71
ashington
-0.71
astical
-0.71
particularly
-0.71
respective
-0.71
Simply
-0.69
rupulous
-0.69
POSITIVE LOGITS
â̦"
1.26
yours
1.13
..."
1.12
â̦"
1.10
..."
1.08
ya
1.07
your
1.05
!'"
1.05
fuckin
1.04
?'"
1.04
Activations Density 0.507%