INDEX
Explanations
information related to technical document formatting and structure
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
176
+0.11
0.6%
65
+0.10
0.6%
173
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
438
+0.11
0.02
373
+0.10
0.02
53
+0.10
0.02
Negative Logits
/%
-1.53
Mathemat
-1.52
Planck
-1.52
TRODUCTION
-1.48
sickness
-1.41
Buddha
-1.39
likes
-1.38
Freud
-1.38
centimeters
-1.37
\]\].
-1.37
POSITIVE LOGITS
ĨĴ
2.39
ķ
2.30
ĥ½
2.13
¡
2.11
Ħ
2.04
ĸ
1.87
²
1.83
§
1.81
¹
1.80
®
1.77
Activations Density 0.010%