INDEX
Explanations
references to bibliographic information and licensing details
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.28
1.6%
181
+0.12
0.7%
432
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
173
+0.28
0.04
10
+0.12
0.04
474
+0.11
0.03
Negative Logits
º
-4.06
¬
-3.89
»¿
-3.80
į
-3.80
Į
-3.72
¶
-3.61
½
-3.61
ī
-3.57
Ħ
-3.54
ĥ½
-3.54
POSITIVE LOGITS
tons
1.52
claimants
1.50
remind
1.48
otherwise
1.43
witness
1.41
encounter
1.36
reward
1.34
supply
1.34
agree
1.33
pair
1.33
Activations Density 0.570%