INDEX
Explanations
The neuron detects terms that signal open or awaiting‐action items (e.g. pending or outstanding entries).
New Auto-Interp
Negative Logits
IFn
-0.08
พวก
-0.07
рет
-0.07
148
-0.06
orbits
-0.06
ساخته
-0.06
Ning
-0.06
reten
-0.06
mayacak
-0.06
勢
-0.06
POSITIVE LOGITS
afflict
0.06
(**
0.06
."↵↵↵↵
0.06
(elem
0.06
neck
0.06
promo
0.06
agenda
0.06
adolescente
0.06
GMC
0.06
Neck
0.06
Activations Density 0.034%