INDEX
Explanations
The neuron activates on tokens in Supreme Court citation strings (e.g. “S.Ct”), i.e. it detects U.S. Supreme Court reporter citations.
New Auto-Interp
Negative Logits
(events
-0.07
через
-0.07
찰
-0.07
(path
-0.07
もっと
-0.06
paralleled
-0.06
cl
-0.06
ign
-0.06
attribute
-0.06
خش
-0.06
POSITIVE LOGITS
961
0.06
Tax
0.06
0.06
udging
0.06
trustee
0.06
só
0.06
oğ
0.06
.Ct
0.06
Север
0.06
correctness
0.06
Activations Density 0.000%