INDEX
Explanations
This neuron detects Q&A metadata terms—words referring to posts and their actions (e.g. “question,” “answer,” “comment,” “wiki,” “post”).
New Auto-Interp
Negative Logits
croll
-0.07
commanding
-0.07
-circle
-0.07
Mission
-0.07
스트
-0.06
geil
-0.06
Lease
-0.06
CircularProgress
-0.06
ASIC
-0.06
獎
-0.06
POSITIVE LOGITS
(pointer
0.06
linea
0.06
аліз
0.06
">×</
0.06
_STATIC
0.05
grav
0.05
eagerly
0.05
pov
0.05
(...)↵
0.05
امة
0.05
Activations Density 0.004%