INDEX
Explanations
court sentences
This neuron activates on legal sentencing details, especially numeric prison‐term mentions and consecutive‐serving language.
New Auto-Interp
Negative Logits
cmap
-0.06
ソ
-0.06
408
-0.06
_SA
-0.06
_pattern
-0.06
'][
-0.06
Andrews
-0.06
blends
-0.06
rema
-0.06
ball
-0.06
POSITIVE LOGITS
khởi
0.08
характеристи
0.08
Measured
0.07
kad
0.06
NAMESPACE
0.06
heavenly
0.06
elek
0.06
progressively
0.06
порушення
0.06
",");↵
0.06
Activations Density 0.001%