INDEX
Explanations
Code/technical documentation
The neuron activates on Spanish instructions for correcting grammar (e.g. words like corregir, errores gramaticales, oración), so it’s looking for Spanish-language grammar‐correction directives.
New Auto-Interp
Negative Logits
discount
-0.07
OA
-0.07
Technician
-0.06
jacket
-0.06
_INCLUDED
-0.06
MAN
-0.06
Ihr
-0.06
enaries
-0.06
vaccinated
-0.06
วม
-0.06
POSITIVE LOGITS
(Request
0.07
>Note
0.07
UNIQUE
0.06
.TODO
0.06
ceil
0.06
lege
0.06
_TAC
0.06
poměr
0.06
.Enums
0.06
靈
0.06
Activations Density 0.066%