INDEX
Explanations
Biblical references and theological concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.11
0.3%
1380
+0.11
0.3%
924
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1786
+0.11
0.01
499
+0.11
0.02
1612
+0.10
0.01
Negative Logits
Shakspeare
-0.91
nutella
-0.90
swarovski
-0.90
fluo
-0.84
Abbé
-0.84
tupperware
-0.81
murano
-0.80
ingrat
-0.77
Shaksp
-0.77
disgra
-0.77
POSITIVE LOGITS
iastes
0.59
<bos>
0.58
Kapitel
0.52
Furcht
0.52
PhysRevD
0.51
AttributeSet
0.51
lossians
0.50
verifyException
0.49
BeginContext
0.48
NSCoder
0.47
Activations Density 0.066%