INDEX
Explanations
This neuron specifically detects the “assistant” speaker tag at the start of an assistant response.
technical details related to processes and instructions.
New Auto-Interp
Negative Logits
buttons
-0.07
expire
-0.06
Numer
-0.06
Unc
-0.06
Uniform
-0.06
나타
-0.06
Succ
-0.06
Administrator
-0.06
Customers
-0.06
CHAIN
-0.06
POSITIVE LOGITS
riteln
0.06
layıcı
0.06
(system
0.06
nem
0.06
oit
0.06
snel
0.06
(%
0.06
PELL
0.06
staple
0.06
fontName
0.06
Activations Density 0.058%