INDEX
Explanations
contrasting viewpoints
The neuron selectively responds to past‐tense action words that drive the narrative (e.g. “laughed,” “begging,” “captives”).
New Auto-Interp
Negative Logits
ourage
-0.07
isode
-0.07
dispositivo
-0.07
(inertia
-0.06
photographer
-0.06
olume
-0.06
-mediated
-0.06
_to
-0.06
adult
-0.06
ivan
-0.06
POSITIVE LOGITS
skb
0.07
xlink
0.07
ıldı
0.07
вт
0.07
NDP
0.06
₹
0.06
воздейств
0.06
prank
0.06
cntl
0.06
(+
0.06
Activations Density 0.019%