INDEX
Explanations
references to films and the film industry
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
93
+0.16
0.9%
156
+0.16
0.9%
100
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
93
+0.16
0.04
100
+0.16
0.04
502
+0.14
0.03
Negative Logits
ĥ½
-2.46
Ń
-2.32
ĭ
-2.25
·
-2.24
ł
-2.24
Ĥ¬
-2.18
Į
-2.16
Ħ
-2.12
ĸ´
-2.08
«
-2.08
POSITIVE LOGITS
iary
1.94
clips
1.86
footage
1.85
ic
1.85
ico
1.79
oon
1.77
fare
1.77
iller
1.73
theaters
1.71
iem
1.69
Activations Density 0.143%