INDEX
Explanations
punctuation
This neuron activates on square-bracket tokens that denote indexing operations (e.g. array or list accesses).
New Auto-Interp
Negative Logits
Hell
-0.08
zeigt
-0.07
زارش
-0.07
Specs
-0.06
ulses
-0.06
Ps
-0.06
ática
-0.06
Glass
-0.06
inery
-0.06
Wrapped
-0.06
POSITIVE LOGITS
společně
0.07
/null
0.07
�
0.07
HTTPHeader
0.07
jící
0.07
režim
0.06
lobbyists
0.06
entertain
0.06
baktı
0.06
foy
0.06
Activations Density 0.052%