INDEX
Explanations
This neuron activates on references to “2D” (i.e. the dimensionality specifier “2D”).
New Auto-Interp
Negative Logits
Somebody
-0.06
states
-0.06
اذ
-0.06
ax
-0.06
Igor
-0.06
burg
-0.06
Jake
-0.06
Ral
-0.06
republice
-0.06
Submitted
-0.06
POSITIVE LOGITS
روش
0.07
ây
0.06
우리
0.06
chiff
0.06
fuscated
0.06
хорош
0.06
여러분
0.06
財
0.06
逸
0.06
creative
0.06
Activations Density 0.004%