INDEX
Explanations
references to information and data structures, particularly dictionary-like terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
410
+0.15
0.9%
434
+0.14
0.8%
158
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
300
+0.15
0.03
503
+0.14
0.01
379
+0.12
0.02
Negative Logits
particular
-1.60
careful
-1.59
*](#
-1.52
'?"
-1.50
"){-1.47
earth
-1.44
ocrine
-1.42
grades
-1.42
continuous
-1.40
*_
-1.39
POSITIVE LOGITS
ŀ
3.75
Ń
3.52
Ĵ
3.37
£
3.35
ĩ
3.28
IJ
3.25
İ
3.24
¥
3.24
ģ
3.19
Ŀ
3.17
Activations Density 0.191%