INDEX
    Explanations

    Scientific publications

    The neuron activates on author initials and names in citation/reference lists.

    New Auto-Interp
    Negative Logits
    Nx
    -0.08
    .setUp
    -0.07
     denim
    -0.07
    ável
    -0.07
    emia
    -0.06
    .Register
    -0.06
     якому
    -0.06
    자가
    -0.06
    locked
    -0.06
    的に
    -0.06
    POSITIVE LOGITS
     lieu
    0.07
     offers
    0.06
     Club
    0.06
     incap
    0.06
    class
    0.06
     yönetim
    0.06
     Goldberg
    0.06
     lov
    0.06
    hasil
    0.06
    	class
    0.06
    Act Density 0.011%

    No Known Activations