INDEX
Explanations
family members
terms related to familial relationships, particularly focusing on grandparents and their interactions or roles within families.
The neuron activates on mentions of “grandparent” (including “grandparents,” “grandmother,” etc.).
New Auto-Interp
Negative Logits
Acts
-0.07
menu
-0.07
lig
-0.07
Menu
-0.07
firms
-0.06
Pad
-0.06
Ent
-0.06
Soul
-0.06
fuse
-0.06
отдель
-0.06
POSITIVE LOGITS
grandma
0.12
grandmother
0.11
grandfather
0.10
grandparents
0.09
Grandma
0.09
GM
0.07
ые
0.07
وغير
0.07
átka
0.07
#=>
0.06
Activations Density 0.006%