INDEX
Explanations
closing parentheses and associated formatting elements in code snippets
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
423
+0.13
0.7%
320
+0.11
0.6%
478
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
105
+0.13
0.07
431
+0.11
0.06
374
+0.10
0.05
Negative Logits
ichi
-1.70
streets
-1.48
costs
-1.44
alike
-1.44
izo
-1.44
criticism
-1.42
ji
-1.40
escence
-1.39
ocene
-1.37
ise
-1.37
POSITIVE LOGITS
¢
4.29
Īĺ
4.29
°
3.90
¯
3.87
¦
3.85
¿
3.80
¾
3.77
¿½
3.67
¬
3.64
·¸
3.54
Activations Density 0.771%