INDEX
Explanations
This neuron activates on numeric age references (and nearby “young” context) in the text.
New Auto-Interp
Negative Logits
Netherlands
-0.07
Maker
-0.06
subsection
-0.06
Bean
-0.06
відом
-0.06
MPU
-0.06
Color
-0.06
robot
-0.06
relieved
-0.06
planner
-0.06
POSITIVE LOGITS
อนท
0.07
olución
0.06
idUser
0.06
sayısı
0.06
ascript
0.06
CONTACT
0.06
}
0.06
LastName
0.06
ای
0.06
repreh
0.06
Activations Density 0.066%