INDEX
Explanations
This neuron detects the standard summary‐order disclaimer phrasing (e.g. “Rules by summary order do not have precedential effect”) used in court documents.
New Auto-Interp
Negative Logits
stripes
-0.07
ooo
-0.07
thread
-0.07
tol
-0.06
ulators
-0.06
।
-0.06
�
-0.06
.(*
-0.06
pity
-0.06
wax
-0.06
POSITIVE LOGITS
ueba
0.07
mają
0.07
nombre
0.07
země
0.07
setError
0.07
Helping
0.06
اجع
0.06
۱۹۸
0.06
íše
0.06
ระบ
0.06
Activations Density 0.001%