INDEX
Explanations
conditions
The neuron detects conditional/hypothetical triggers (e.g. “if,” “when”) that introduce “what happens” style questions.
New Auto-Interp
Negative Logits
يا
-0.06
вел
-0.06
">↵↵↵
-0.06
gnore
-0.06
冬
-0.06
296
-0.06
tuyệt
-0.06
궁
-0.06
ERSIST
-0.06
さら
-0.06
POSITIVE LOGITS
Moses
0.08
pr
0.07
(acc
0.06
shortest
0.06
امروز
0.06
Received
0.06
Moder
0.06
[Z
0.06
Yesterday
0.06
"/>
0.06
Activations Density 0.033%