INDEX
Explanations
storytelling
This neuron is sensitive to the model’s internal control and metadata tokens (e.g. turn delimiters, header IDs, end-of-text markers) rather than ordinary content words.
Requests to roleplay as a deceased grandmother (often framed in erotic or otherwise inappropriate familial roleplay).
second-person guided fantasy or roleplay with sensual/erotic undertones presented as soothing, bedtime-style narration.
New Auto-Interp
Negative Logits
peers
-0.07
ôn
-0.07
Nu
-0.07
stamp
-0.07
Arn
-0.06
farewell
-0.06
Province
-0.06
_short
-0.06
-mean
-0.06
Созд
-0.06
POSITIVE LOGITS
['./
0.07
mentor
0.06
中文
0.06
"/> ↵
0.06
obsession
0.06
وات
0.06
VAR
0.06
.perm
0.06
serta
0.06
="{0.06
Activations Density 0.019%