INDEX
Explanations
observation
This neuron never activates—it effectively looks for nothing (it’s a “dead” neuron).
New Auto-Interp
Negative Logits
.Formatting
-0.06
depot
-0.06
-mean
-0.06
RESET
-0.06
dust
-0.06
IPs
-0.06
,tp
-0.06
_ca
-0.06
Duplicates
-0.06
dusty
-0.06
POSITIVE LOGITS
observation
0.11
observation
0.10
observations
0.08
observations
0.08
Observation
0.08
incur
0.07
athleticism
0.07
巴
0.06
observe
0.06
)(↵
0.06
Activations Density 0.017%