INDEX
Explanations
Scientific publications
The neuron activates on author initials and names in citation/reference lists.
New Auto-Interp
Negative Logits
Nx
-0.08
.setUp
-0.07
denim
-0.07
ável
-0.07
emia
-0.06
.Register
-0.06
якому
-0.06
자가
-0.06
locked
-0.06
的に
-0.06
POSITIVE LOGITS
lieu
0.07
offers
0.06
Club
0.06
incap
0.06
class
0.06
yönetim
0.06
Goldberg
0.06
lov
0.06
hasil
0.06
class
0.06
Activations Density 0.011%