INDEX
Explanations
questions that can be answered with numeric values or related calculations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.14
0.8%
8
+0.11
0.6%
115
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
8
+0.14
0.01
357
+0.11
0.01
367
+0.10
0.01
Negative Logits
listed
-1.77
quarters
-1.53
futures
-1.53
tests
-1.52
caster
-1.51
boards
-1.50
through
-1.46
papers
-1.46
notes
-1.45
notes
-1.44
POSITIVE LOGITS
»
2.00
ĩ
1.76
IJ
1.75
¯
1.74
·
1.73
±
1.62
´
1.62
Ł
1.59
ĥ½
1.58
¹
1.56
Activations Density 0.025%