INDEX
Explanations
This neuron detects references to an appeal of a court decision (i.e. occurrences of the word “appeal”).
New Auto-Interp
Negative Logits
фектив
-0.06
진
-0.06
fires
-0.06
Ramos
-0.06
amarin
-0.06
_CTRL
-0.06
нату
-0.06
Legendary
-0.06
clip
-0.06
/project
-0.06
POSITIVE LOGITS
Ut
0.07
drastically
0.07
TECHNO
0.06
clearer
0.06
جوی
0.06
.invoke
0.06
ज
0.06
(sprite
0.06
}'
0.06
>r
0.06
Activations Density 0.006%