INDEX
Explanations
This neuron detects the causal conjunction “because.”
New Auto-Interp
Negative Logits
<n
-0.07
_pts
-0.07
11
-0.07
Split
-0.07
span
-0.07
ANTLR
-0.07
54
-0.07
std
-0.07
(tol
-0.07
ladder
-0.07
POSITIVE LOGITS
because
0.22
because
0.20
Because
0.18
Because
0.16
ecause
0.11
cuz
0.10
porque
0.10
cause
0.09
لأن
0.09
Although
0.08
Activations Density 0.045%