INDEX
Explanations
themes related to divine creation and celestial glory
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.27
0.9%
1842
+0.16
0.5%
1343
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1446
+0.27
0.07
1842
+0.16
0.05
1284
+0.12
0.07
Negative Logits
<bos>
-1.61
netto
-0.69
stoff
-0.68
dè
-0.68
bronz
-0.65
anse
-0.63
vinil
-0.63
tille
-0.62
maso
-0.62
tré
-0.62
POSITIVE LOGITS
unlaw
0.98
despotism
0.96
Minang
0.91
pamph
0.87
tucson
0.85
bandung
0.85
dott
0.85
îna
0.85
swarovski
0.83
uncin
0.83
Activations Density 0.670%