INDEX
Explanations
periods and significant pauses in sentences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
258
+0.19
1.1%
23
+0.16
0.9%
369
+0.15
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
258
+0.19
0.13
189
+0.16
0.09
23
+0.15
0.10
Negative Logits
enstein
-1.68
aspect
-1.66
enium
-1.64
aan
-1.63
"}](#
-1.61
etc
-1.57
dimension
-1.57
gor
-1.54
eries
-1.49
ocene
-1.49
POSITIVE LOGITS
Reuters
2.07
MPs
2.02
lawmakers
1.87
Ī
1.86
Researchers
1.80
Conservatives
1.79
§
1.79
AFP
1.75
»¿
1.75
ĩ
1.74
Activations Density 0.946%