INDEX
Explanations
instances of the word "commerce."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
460
+0.11
0.7%
392
+0.11
0.7%
119
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
460
+0.11
0.01
119
+0.11
0.01
19
+0.11
0.01
Negative Logits
bowed
-1.67
ij
-1.58
IJ
-1.53
crossed
-1.45
NULL
-1.44
³
-1.43
selected
-1.41
Gay
-1.41
whole
-1.40
gay
-1.37
POSITIVE LOGITS
vist
2.15
istry
2.04
naire
1.99
eer
1.92
esan
1.91
yer
1.90
fors
1.90
processor
1.85
ilage
1.82
bard
1.77
Activations Density 0.313%