INDEX
Explanations
The neuron selectively activates on named entities and other specific technical or proper nouns (e.g. organization names, people’s names, specialized terms).
New Auto-Interp
Negative Logits
Tik
-0.07
Fernando
-0.06
Heritage
-0.06
FK
-0.06
ela
-0.06
TODAY
-0.06
Paul
-0.06
appealing
-0.06
.one
-0.06
دانشگاه
-0.06
POSITIVE LOGITS
اختی
0.07
JOptionPane
0.07
اختصاص
0.07
Ros
0.07
Trad
0.07
propri
0.06
Happy
0.06
creates
0.06
imperson
0.06
0.06
Activations Density 0.180%