INDEX
Explanations
The neuron activates whenever the text mentions “server” (including plural “servers”).
New Auto-Interp
Negative Logits
объект
-0.06
ственно
-0.06
(dataset
-0.06
Tabla
-0.06
erman
-0.06
ースト
-0.06
Derby
-0.06
кар
-0.06
errorCallback
-0.06
wash
-0.06
POSITIVE LOGITS
,@
0.08
jt
0.07
कथ
0.07
ISS
0.07
reviewer
0.07
-checkbox
0.07
.Intent
0.06
%@
0.06
_BOOLEAN
0.06
prolific
0.06
Activations Density 0.013%