INDEX
Explanations
references to legislation, organizations, and technical influences such as operating systems throughout the text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.22
0.8%
50
+0.19
0.7%
2019
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.22
0.07
50
+0.19
0.05
1967
+0.15
0.04
Negative Logits
vété
-1.02
marchand
-0.91
jouet
-0.91
noël
-0.90
malheureux
-0.89
nuage
-0.89
couteau
-0.88
oeil
-0.87
géant
-0.86
tramonto
-0.85
POSITIVE LOGITS
ideolog
0.88
solidar
0.79
amount
0.78
astéro
0.71
maig
0.69
reputa
0.67
minuta
0.65
possibility
0.64
extent
0.62
tenden
0.61
Activations Density 0.387%