INDEX
Explanations
This neuron detects occurrences of placeholder entity tokens of the form “NAME_‹number›” in the text.
New Auto-Interp
Negative Logits
ительное
-0.07
otechnology
-0.07
$
-0.07
trainer
-0.06
ibu
-0.06
паль
-0.06
mafia
-0.06
áže
-0.06
isi
-0.06
commits
-0.06
POSITIVE LOGITS
arbe
0.07
учеб
0.06
نب
0.06
[].
0.06
indrical
0.06
Sov
0.06
ernals
0.06
předch
0.06
�
0.05
�
0.05
Activations Density 0.030%