INDEX
Explanations
The neuron fires on mentions of “server” (especially in the context of web or HTTP servers).
New Auto-Interp
Negative Logits
edula
-0.06
preca
-0.06
녀
-0.06
кафед
-0.06
놀
-0.06
ẵn
-0.06
Greatest
-0.06
Pear
-0.06
าต
-0.06
안
-0.06
POSITIVE LOGITS
alterations
0.07
sucked
0.07
サービス
0.07
bek
0.07
khá
0.06
Filed
0.06
-posts
0.06
_filters
0.06
이런
0.06
ото
0.06
Activations Density 0.032%