INDEX
Explanations
scientific papers
This neuron fires on author surnames (proper names) in the paper metadata.
New Auto-Interp
Negative Logits
_alert
-0.06
Bert
-0.06
arrival
-0.06
figures
-0.06
.strings
-0.06
fuse
-0.06
ویزی
-0.06
$sub
-0.06
isLoggedIn
-0.06
�
-0.05
POSITIVE LOGITS
ende
0.07
ija
0.07
NONE
0.07
.defineProperty
0.07
gerade
0.07
smirk
0.07
scattered
0.07
gradually
0.06
익
0.06
exc
0.06
Activations Density 0.020%