INDEX
Explanations
This neuron activates on the token “If,” effectively detecting conditional statements or clauses.
New Auto-Interp
Negative Logits
battery
-0.07
flake
-0.06
drink
-0.06
biology
-0.06
итель
-0.06
IENCE
-0.06
delimiter
-0.06
_pin
-0.06
люч
-0.06
diamonds
-0.06
POSITIVE LOGITS
experimentation
0.07
فن
0.06
Пари
0.06
.";
0.06
!");
0.06
(dat
0.06
UINT
0.06
ek
0.06
asn
0.06
";}↵
0.06
Activations Density 0.017%