INDEX
Explanations
references to the Atlanta Falcons
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.19
1.1%
71
+0.13
0.7%
59
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
59
+0.19
0.01
71
+0.13
0.01
111
+0.12
0.01
Negative Logits
ĵ
-2.33
Ŀ
-2.26
ħ
-2.21
ij
-2.21
ł
-2.17
¾
-2.15
İ
-2.13
ĥ½
-2.10
¶
-2.06
ĥ
-2.03
POSITIVE LOGITS
oon
1.86
staff
1.86
ettes
1.76
ues
1.74
arty
1.64
ue
1.63
ù
1.60
ilities
1.59
uer
1.58
well
1.51
Activations Density 0.010%