INDEX
Explanations
The neuron detects the special control and protocol tokens used to structure the conversation (e.g. the loop-of-Thought/Action/Observation markers and header delimiters).
New Auto-Interp
Negative Logits
39
-0.07
٢
-0.07
Slider
-0.06
mpar
-0.06
monopoly
-0.06
۱۲
-0.06
idle
-0.06
сен
-0.06
LL
-0.06
DispatchQueue
-0.06
POSITIVE LOGITS
Shi
0.07
فت
0.06
vý
0.06
Anniversary
0.06
ัวร
0.06
อนท
0.06
(PyObject
0.06
歲
0.06
виды
0.06
Elvis
0.05
Activations Density 0.015%