INDEX
Explanations
The neuron detects mentions of Islamic sectarian identifiers—particularly “Sunni” and “Shia.”
New Auto-Interp
Negative Logits
عداد
-0.07
broadcasters
-0.07
'Brien
-0.06
бург
-0.06
Norm
-0.06
okul
-0.06
پیش
-0.06
آس
-0.06
|\
-0.06
woven
-0.06
POSITIVE LOGITS
-wh
0.07
it
0.06
.ct
0.06
cl
0.06
захворювання
0.06
di
0.06
(ex
0.06
Cl
0.06
recep
0.06
іє
0.06
Activations Density 0.001%