INDEX
Explanations
This neuron activates on the “preced” substring in legal headings—i.e. it detects occurrences of words like “precedential” (often in “nonprecedential”).
New Auto-Interp
Negative Logits
-0.06
/../
-0.06
.show
-0.06
.tile
-0.06
tarn
-0.06
ketøy
-0.06
misogyn
-0.06
все
-0.06
Deng
-0.06
tile
-0.06
POSITIVE LOGITS
заход
0.07
_sm
0.07
Şubat
0.07
roll
0.07
лаз
0.06
—he
0.06
तब
0.06
ografie
0.06
Perez
0.06
adb
0.06
Activations Density 0.000%