INDEX
Explanations
references to legal proceedings or police investigations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
874
+0.17
0.8%
1573
+0.17
0.8%
1961
+0.16
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.17
0.04
732
+0.17
0.03
1177
+0.16
0.02
Negative Logits
mī
-0.73
izvē
-0.57
philosophic
-0.56
UnusedPrivate
-0.54
vī
-0.53
Marquette
-0.52
vairāk
-0.52
ferric
-0.52
Doran
-0.51
liberality
-0.48
POSITIVE LOGITS
Watson
1.27
Watson
1.13
Holmes
0.98
WATSON
0.93
Sherlock
0.89
Holmes
0.88
Sherlock
0.79
IBM
0.72
IBM
0.68
Conan
0.64
Activations Density 0.265%