INDEX
Explanations
the repeated occurrence of the number 112
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.17
0.9%
412
+0.13
0.8%
307
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
307
+0.17
0.02
445
+0.13
0.01
44
+0.12
0.02
Negative Logits
chers
-1.90
fame
-1.84
mean
-1.75
whom
-1.72
priori
-1.43
significance
-1.42
acles
-1.38
rition
-1.38
novelist
-1.35
ributors
-1.33
POSITIVE LOGITS
¶
1.99
¬
1.87
³
1.79
¹
1.77
º
1.74
ª
1.72
®
1.66
hart
1.63
´
1.62
·
1.61
Activations Density 0.017%