INDEX
    Explanations

    The neuron selectively activates on named entities and other specific technical or proper nouns (e.g. organization names, people’s names, specialized terms).

    New Auto-Interp
    Negative Logits
     Tik
    -0.07
     Fernando
    -0.06
     Heritage
    -0.06
    FK
    -0.06
    ela
    -0.06
     TODAY
    -0.06
     Paul
    -0.06
     appealing
    -0.06
    .one
    -0.06
     دانشگاه
    -0.06
    POSITIVE LOGITS
    اختی
    0.07
    	JOptionPane
    0.07
     اختصاص
    0.07
    Ros
    0.07
    Trad
    0.07
    propri
    0.06
    Happy
    0.06
    creates
    0.06
     imperson
    0.06
        
    0.06
    Act Density 0.180%

    No Known Activations