INDEX
Explanations
comma or quote
This neuron detects high‐level “meta” instructions or policy directives telling the assistant how to behave (e.g. “only send the completion based on the system instructions,” “don’t repeat,” etc.).
New Auto-Interp
Negative Logits
.Bit
-0.07
.plugin
-0.07
виконав
-0.07
chips
-0.07
StackNavigator
-0.06
Falcon
-0.06
선수
-0.06
_column
-0.06
+j
-0.06
:uint
-0.06
POSITIVE LOGITS
surv
0.06
Saskatchewan
0.06
Verify
0.06
.Find
0.06
ogi
0.06
'action
0.06
》
0.06
versatility
0.06
fade
0.06
gatsby
0.06
Activations Density 0.026%