INDEX
Explanations
The neuron selectively activates on the multi‐token word “fallopian” (as in “fallopian tubes”), i.e. references to the fallopian tubes.
New Auto-Interp
Negative Logits
riting
-0.07
腹
-0.07
Multiplicity
-0.06
otechnology
-0.06
zar
-0.06
rough
-0.06
partie
-0.06
字幕
-0.06
سری
-0.06
etal
-0.06
POSITIVE LOGITS
pan
0.08
руб
0.07
激
0.07
صنع
0.07
/lab
0.06
nomin
0.06
Galaxy
0.06
RegExp
0.06
切
0.06
buena
0.06
Activations Density 0.000%