INDEX
Explanations
This neuron responds to mentions of things being “missed” (i.e. failures or error rates in detection tasks).
New Auto-Interp
Negative Logits
redirect
-0.07
dev
-0.06
.getMax
-0.06
стер
-0.06
-it
-0.06
.combine
-0.06
sah
-0.06
composite
-0.06
repeat
-0.06
.ab
-0.06
POSITIVE LOGITS
Failed
0.06
dao
0.06
姓名
0.06
_slots
0.06
мене
0.06
为什么
0.06
شود
0.06
(--
0.06
文献
0.06
huge
0.06
Activations Density 0.019%