INDEX
Explanations
component
This neuron never activates—it doesn’t detect any pattern or feature.
New Auto-Interp
Negative Logits
fre
-0.06
uterus
-0.06
iod
-0.06
.collection
-0.06
월
-0.06
(model
-0.06
_SLEEP
-0.06
))).
-0.06
());↵↵
-0.05
Glock
-0.05
POSITIVE LOGITS
Antib
0.07
Applicants
0.07
.CenterScreen
0.07
licants
0.07
learned
0.06
_support
0.06
救
0.06
خان
0.06
Savings
0.06
################################
0.06
Activations Density 0.004%