INDEX
Explanations
The neuron is triggered by second‐person self-referential language—especially the possessive pronoun “your.”
elements related to authority figures and their interactions with subordinates.
New Auto-Interp
Negative Logits
shipment
-0.08
sne
-0.07
스티
-0.07
rooms
-0.07
sciences
-0.07
Tories
-0.07
house
-0.07
otal
-0.06
lapping
-0.06
SHORT
-0.06
POSITIVE LOGITS
pageIndex
0.07
orderby
0.06
context
0.06
getWindow
0.06
Brun
0.06
','$
0.06
rb
0.06
'(
0.06
еного
0.05
CheckBox
0.05
Activations Density 0.000%