INDEX
Explanations
laws and legal references
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1961
+0.13
0.4%
1565
+0.12
0.4%
1331
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1105
+0.13
0.03
1499
+0.12
0.03
1585
+0.11
0.02
Negative Logits
aen
-1.25
ibi
-1.07
meis
-1.07
fta
-1.07
fte
-1.04
magis
-1.03
mef
-1.02
fep
-1.00
ftu
-1.00
sii
-1.00
POSITIVE LOGITS
Act
1.22
Act
1.10
act
0.99
acts
0.88
act
0.87
Acts
0.84
Acts
0.79
acted
0.79
ACT
0.79
ACT
0.70
Activations Density 0.062%