INDEX
Explanations
references to divine authority and prophecy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.25
0.8%
453
+0.12
0.4%
1343
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1446
+0.25
0.06
337
+0.12
0.04
185
+0.10
0.05
Negative Logits
<bos>
-2.10
الحياه
-0.78
,
-0.77
public
-0.74
or
-0.73
in
-0.72
Tē
-0.71
/*
-0.71
am
-0.70
,
-0.70
POSITIVE LOGITS
affor
2.14
volunte
2.06
accla
2.03
michelin
1.97
reluct
1.97
sappi
1.92
swarovski
1.91
increa
1.88
philanth
1.87
wherea
1.87
Activations Density 0.208%