INDEX
Explanations
This neuron detects U.S. district‐court jurisdiction names (i.e. the state/region labels in the court heading).
New Auto-Interp
Negative Logits
lent
-0.07
Caldwell
-0.06
젠
-0.06
(cap
-0.06
Calder
-0.06
Archer
-0.06
aku
-0.06
цеп
-0.06
OBS
-0.06
tracker
-0.06
POSITIVE LOGITS
////
0.07
默认
0.07
OpenGL
0.06
ustanov
0.06
ltre
0.06
_insert
0.06
년
0.06
февраля
0.06
Scotland
0.06
atau
0.06
Activations Density 0.002%