INDEX
Explanations
occurrences of commas and periods in the text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
74
+0.11
0.6%
349
+0.11
0.6%
186
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
349
+0.11
0.02
74
+0.11
0.02
253
+0.10
0.02
Negative Logits
ľĵ
-1.74
,&
-1.62
onica
-1.59
(),
-1.50
»¿
-1.47
↵
-1.47
č↵
-1.47
-1.47
<|outofrange|>
-1.47
↵
-1.47
POSITIVE LOGITS
sible
1.59
Ãħ
1.41
]{.1.32
disabling
1.25
Corps
1.25
âĸ
1.21
ius
1.20
NESS
1.20
sein
1.19
WARRANTIES
1.18
Activations Density 0.052%