INDEX
Explanations
Game command sequences
The neuron specifically detects the model’s internal protocol/control tokens (e.g. metadata markers like <|eot_id|>, header delimiters, and other non‐content structural tags).
New Auto-Interp
Negative Logits
arranging
-0.07
Greatest
-0.06
igor
-0.06
importantes
-0.06
sending
-0.06
########################################################
-0.06
accelerated
-0.06
mitter
-0.06
drinking
-0.06
-best
-0.06
POSITIVE LOGITS
(edit
0.08
moz
0.07
řít
0.07
.orig
0.07
ليات
0.06
moss
0.06
azione
0.06
ACKET
0.06
느�
0.06
České
0.06
Activations Density 0.028%