INDEX
Explanations
This neuron responds to phrasing that indicates someone “has been tasked with” or assigned a responsibility.
New Auto-Interp
Negative Logits
mentoring
-0.07
incl
-0.06
,因
-0.06
seaborn
-0.06
LEFT
-0.06
DAY
-0.06
ENO
-0.06
pz
-0.06
Pamela
-0.06
_WALL
-0.06
POSITIVE LOGITS
�
0.07
г
0.07
vý
0.06
trag
0.06
rası
0.06
requ
0.06
kám
0.06
ậm
0.06
ční
0.06
üğ
0.06
Activations Density 0.031%