INDEX
Explanations
This neuron selectively activates on high-frequency short function words—especially prepositions and articles like “of,” “in,” and “the.”
New Auto-Interp
Negative Logits
Esper
-0.06
Temple
-0.06
pasa
-0.06
Emoji
-0.06
Painter
-0.06
Watching
-0.06
veis
-0.06
fleet
-0.06
Parent
-0.06
Century
-0.06
POSITIVE LOGITS
کسب
0.07
گردید
0.07
magg
0.07
ระบ
0.07
’une
0.07
.ShowDialog
0.06
cerv
0.06
'une
0.06
#+#
0.06
(annotation
0.06
Activations Density 0.102%