INDEX
Explanations
Affirmation
The neuron reliably lights up on grammar‐check commentary—tokens around evaluating or correcting sentence grammar (e.g. “sentence,” “grammatically,” “correct,” “improve,” “revision”).
New Auto-Interp
Negative Logits
(digits
-0.07
_keeper
-0.06
经过
-0.06
ords
-0.06
oppos
-0.06
ITTE
-0.06
θούν
-0.06
Js
-0.06
spent
-0.06
위원
-0.06
POSITIVE LOGITS
testName
0.07
�
0.06
Mourinho
0.06
대로
0.06
clearInterval
0.06
илось
0.06
_reply
0.06
>\<^
0.06
automáticamente
0.06
номер
0.06
Activations Density 0.021%