INDEX
Explanations
This neuron detects unusually long or repeated-character tokens and document-boundary/extended-input tokens—i.e., long continuous strings or markers that indicate large input blocks.
special tokens and markers that indicate conversational structure, particularly turn boundaries and role transitions in chat-formatted dialogue.
chat-conversation boundary markers and special formatting tokens (like start/end of turns and headers).
New Auto-Interp
Negative Logits
�
-0.08
garments
-0.07
掛
-0.07
Carson
-0.07
empty
-0.07
АН
-0.07
standing
-0.06
敌
-0.06
accom
-0.06
тех
-0.06
POSITIVE LOGITS
där
0.07
.sym
0.07
CGRectMake
0.06
墙
0.06
أيضا
0.06
>();
0.06
categoryId
0.06
pageTitle
0.06
товарів
0.06
.setMessage
0.06
Activations Density 15.233%