INDEX
Explanations
terms related to technology and data security
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
872
+0.10
0.3%
604
+0.09
0.2%
736
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
455
+0.10
0.02
1501
+0.09
0.04
707
+0.09
0.04
Negative Logits
mef
-1.54
fluo
-1.47
wien
-1.47
Juf
-1.42
Keny
-1.41
franz
-1.40
bordeaux
-1.34
anton
-1.33
hcm
-1.32
meis
-1.29
POSITIVE LOGITS
ensures
1.02
makes
1.01
allows
0.95
enables
0.93
gives
0.92
indicates
0.88
helps
0.87
creates
0.85
provides
0.85
suggests
0.85
Activations Density 0.317%