INDEX
Explanations
references to organizations and institutions involved in advocacy or legislative matters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
457
+0.15
0.8%
160
+0.13
0.7%
410
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
457
+0.15
0.06
440
+0.13
0.05
160
+0.12
0.05
Negative Logits
mann
-1.64
imer
-1.50
icular
-1.49
access
-1.48
sense
-1.44
indication
-1.44
InstanceState
-1.41
chart
-1.40
ier
-1.38
limit
-1.38
POSITIVE LOGITS
IJ
4.40
¿½
4.39
ľĵ
4.36
»¿
4.25
½
4.22
↵
4.21
<|outofrange|>
4.21
↵
4.21
<|outofrange|>
4.21
↵ âĢĥ
4.21
Activations Density 0.370%