INDEX
Explanations
The neuron detects the token “Yes” when it’s used to start an answer.
New Auto-Interp
Negative Logits
(depend
-0.06
.•
-0.06
_books
-0.06
continental
-0.06
("`-0.06
([-
-0.06
IDX
-0.06
nocení
-0.06
Rh
-0.06
tokenId
-0.06
POSITIVE LOGITS
Чтобы
0.07
Projekt
0.07
inking
0.06
Until
0.06
صنعت
0.06
toHave
0.06
-budget
0.06
salary
0.06
ับความ
0.06
oriasis
0.06
Activations Density 0.000%