INDEX
Explanations
words related to precious metals, specifically gold
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.11
0.3%
1385
+0.10
0.3%
1992
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1409
+0.11
0.02
284
+0.10
0.05
54
+0.09
0.04
Negative Logits
affor
-2.23
disagre
-2.23
reluct
-2.22
increa
-2.21
shenan
-2.17
encomp
-2.17
unspeak
-2.15
intersper
-2.14
volunte
-2.08
guarante
-2.06
POSITIVE LOGITS
<bos>
0.85
Portale
0.83
Clik
0.74
AssemblyCulture
0.73
sapiens
0.71
eleste
0.70
enumii
0.68
Portail
0.67
CWE
0.66
----</
0.66
Activations Density 0.607%