INDEX
Explanations
requests and demands
The neuron activates on foreign‐language (non-English) word fragments—particularly Slavic/Slovene tokens with diacritics.
New Auto-Interp
Negative Logits
Lux
-0.06
Ner
-0.06
Applied
-0.06
grinder
-0.06
_rd
-0.06
election
-0.06
considering
-0.06
applied
-0.06
possibility
-0.06
ita
-0.06
POSITIVE LOGITS
ději
0.07
/css
0.07
acomment
0.07
navbar
0.06
taient
0.06
dk
0.06
¡
0.06
.sap
0.06
_kategori
0.06
(iv
0.06
Activations Density 0.177%