INDEX
Explanations
references to various types of buildings and locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.27
1.6%
56
+0.16
1.0%
465
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
156
+0.27
0.39
458
+0.16
0.37
71
+0.11
0.36
Negative Logits
ĥ½
-3.12
Ķ
-2.55
±
-2.52
¾
-2.52
¦
-2.46
Ŀ
-2.45
¤
-2.40
IJ
-2.36
º
-2.36
«
-2.36
POSITIVE LOGITS
owner
2.08
yard
2.07
arrangement
1.94
fires
1.90
arrangements
1.83
system
1.81
owner
1.78
renovation
1.75
wright
1.75
holder
1.73
Activations Density 1.650%