INDEX
Explanations
The main thing this neuron does is detect references to children and family (e.g. “children,” “families,” “young boy”).
expressions of emotional responses and experiences related to reading or engaging with stories.
New Auto-Interp
Negative Logits
goalkeeper
-0.07
Assertion
-0.06
adr
-0.06
sour
-0.06
boot
-0.06
Interested
-0.06
beverage
-0.06
Hence
-0.06
Recruitment
-0.06
-0.06
POSITIVE LOGITS
-е
0.06
честь
0.06
артам
0.06
vine
0.06
distract
0.06
reload
0.06
.rx
0.06
please
0.06
_street
0.06
.Circle
0.06
Activations Density 0.168%