INDEX
Explanations
references to significant legal consequences in a judicial context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
434
+0.12
0.7%
254
+0.12
0.6%
84
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
438
+0.12
0.40
53
+0.12
0.42
56
+0.11
0.11
Negative Logits
º
-1.69
"/>
-1.62
¾
-1.53
";
-1.48
¹
-1.41
aboard
-1.35
");
-1.35
).](
-1.34
descending
-1.33
").
-1.32
POSITIVE LOGITS
orate
1.63
subclass
1.54
th
1.51
ibase
1.49
ky
1.48
ovir
1.47
oufl
1.45
DTD
1.43
enance
1.42
ocardi
1.41
Activations Density 4.670%