INDEX
Explanations
references to procedural or regulatory steps in contexts such as legal or construction processes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.74
6.0%
23
+0.21
1.7%
478
+0.11
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
23
+0.74
0.20
478
+0.21
-0.02
240
+0.11
-0.05
Negative Logits
аÑħ
-1.58
:`
-1.50
à½
-1.46
ி
-1.45
à¯
-1.44
à®ķ
-1.43
ressor
-1.40
Õ¥
-1.39
à²
-1.38
á̏
-1.38
POSITIVE LOGITS
respect
1.43
blunt
1.23
how
1.22
basic
1.20
another
1.20
amounts
1.18
gaug
1.18
ching
1.17
precise
1.16
suppose
1.16
Activations Density 4.359%