INDEX
Explanations
The neuron activates on words referring to crafting activities and craft‐related terms (e.g., “craft,” “crafting,” “DIY”).
New Auto-Interp
Negative Logits
daf
-0.06
цеп
-0.06
abbo
-0.06
rieve
-0.06
timer
-0.06
дет
-0.06
Properties
-0.06
vier
-0.05
ayar
-0.05
Levine
-0.05
POSITIVE LOGITS
FB
0.07
Welcome
0.07
Desert
0.07
Rec
0.07
Sınıf
0.07
_segment
0.07
.Wrap
0.06
pletion
0.06
Crafts
0.06
slime
0.06
Activations Density 0.053%