INDEX
Explanations
The neuron activates on the word “status,” especially when used as a heading or label (e.g. “Status”).
New Auto-Interp
Negative Logits
vine
-0.08
quote
-0.07
twelve
-0.07
ween
-0.07
/main
-0.07
aket
-0.07
210
-0.06
Mine
-0.06
Remote
-0.06
Let
-0.06
POSITIVE LOGITS
status
0.15
Status
0.13
Status
0.12
status
0.11
statuses
0.10
-status
0.09
status
0.09
_status
0.09
estatus
0.09
_Status
0.08
Activations Density 0.018%