INDEX
Explanations
delivery-related terms and conditions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.19
1.1%
220
+0.14
0.8%
376
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
220
+0.19
0.01
111
+0.14
0.01
403
+0.12
0.01
Negative Logits
cluding
-1.79
tern
-1.70
cha
-1.62
quired
-1.60
neut
-1.56
cludes
-1.50
iginally
-1.47
dependence
-1.45
full
-1.40
lifetime
-1.39
POSITIVE LOGITS
leys
1.94
oque
1.94
etto
1.90
aire
1.74
ERTY
1.68
enez
1.67
esan
1.66
ek
1.66
uxe
1.57
inux
1.56
Activations Density 0.012%