INDEX
Explanations
The neuron fires strongly on the main narrative “action” verbs that drive plot summaries.
New Auto-Interp
Negative Logits
CString
-0.08
ุษย
-0.07
)NULL
-0.07
خور
-0.07
melodies
-0.07
ूचन
-0.07
Symfony
-0.06
requestId
-0.06
’am
-0.06
آورد
-0.06
POSITIVE LOGITS
Calcul
0.07
Catalog
0.06
}$
0.06
допомогою
0.06
_sequence
0.06
tbl
0.06
研究所
0.06
0.06
delimiter
0.06
CONST
0.06
Activations Density 0.018%