INDEX
Explanations
violent and criminal actions or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.12
0.3%
1177
+0.09
0.3%
946
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.12
0.06
1487
+0.09
0.06
100
+0.07
0.05
Negative Logits
nemia
-0.64
quarelle
-0.60
Zwiebel
-0.55
>\<
-0.53
Solubility
-0.53
Fieber
-0.53
etui
-0.53
NotImplemented
-0.52
meninos
-0.52
nè
-0.51
POSITIVE LOGITS
<bos>
0.76
apprehen
0.76
McLaugh
0.75
reconno
0.73
gaily
0.72
Juf
0.71
homeward
0.69
Gorb
0.69
Bartholo
0.69
unspeak
0.68
Activations Density 0.280%