INDEX
Explanations
directory
This neuron does not activate on any tokens—it does not detect or respond to any particular pattern.
New Auto-Interp
Negative Logits
Archived
-0.06
Filed
-0.06
amus
-0.06
iful
-0.06
_months
-0.06
unicorn
-0.06
dân
-0.06
TER
-0.06
brides
-0.06
Ë
-0.06
POSITIVE LOGITS
Environment
0.06
Chain
0.06
481
0.06
Driver
0.06
Factory
0.06
Reader
0.06
结构
0.06
broadcaster
0.06
cwd
0.06
dann
0.06
Activations Density 0.004%