INDEX
    Explanations

    This neuron activates on references to “2D” (i.e. the dimensionality specifier “2D”).

    New Auto-Interp
    Negative Logits
     Somebody
    -0.06
    states
    -0.06
     اذ
    -0.06
    	ax
    -0.06
     Igor
    -0.06
     burg
    -0.06
    Jake
    -0.06
    Ral
    -0.06
     republice
    -0.06
    Submitted
    -0.06
    POSITIVE LOGITS
     روش
    0.07
    ây
    0.06
     우리
    0.06
     chiff
    0.06
    fuscated
    0.06
     хорош
    0.06
     여러분
    0.06
    0.06
    0.06
     creative
    0.06
    Act Density 0.004%

    No Known Activations