INDEX
Explanations
references to shooting incidents
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
358
+0.17
1.0%
164
+0.12
0.7%
490
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
45
+0.17
0.02
358
+0.12
0.03
134
+0.11
0.02
Negative Logits
ľĵ
-3.93
ŀ
-3.81
ļ
-3.72
ĭ
-3.71
ķ
-3.68
¶
-3.63
§
-3.63
ĥ½
-3.58
°
-3.56
·¸
-3.55
POSITIVE LOGITS
guns
3.28
gun
2.46
oslav
2.00
shots
1.91
nikov
1.84
outs
1.73
ogue
1.66
fired
1.66
Vict
1.60
blast
1.58
Activations Density 0.237%