INDEX
Explanations
mentions of mature or explicit content in narratives.
The neuron fires on the “Content Warning” header—especially the phrase “work of fiction” in the warning block.
New Auto-Interp
Negative Logits
804
-0.06
-duration
-0.06
하신
-0.06
-free
-0.06
burn
-0.06
-k
-0.06
believers
-0.06
사는
-0.06
moderne
-0.06
肯
-0.06
POSITIVE LOGITS
Xi
0.07
Alibaba
0.07
Postal
0.07
.getBoundingClientRect
0.07
:uint
0.06
하다
0.06
Gi�
0.06
ypress
0.06
Workspace
0.06
ší
0.06
Activations Density 0.001%