INDEX
Explanations
reference to intensity levels in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.18
1.1%
376
+0.14
0.8%
497
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
116
+0.18
0.01
497
+0.14
0.01
179
+0.13
0.01
Negative Logits
ções
-1.69
fortune
-1.58
happier
-1.55
Ĥ
-1.49
±
-1.46
abandon
-1.44
¼
-1.44
conject
-1.44
ção
-1.43
unconscious
-1.42
POSITIVE LOGITS
ily
1.72
frame
1.72
chart
1.67
charts
1.66
RT
1.58
isp
1.58
uly
1.55
ICU
1.54
IMAGE
1.51
CT
1.50
Activations Density 0.006%