INDEX
Explanations
comparisons or thresholds indicated by numerical values or measurements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
118
+0.12
0.7%
465
+0.12
0.7%
383
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
70
+0.12
0.02
383
+0.12
0.02
30
+0.12
0.01
Negative Logits
ball
-1.85
éric
-1.67
neh
-1.48
...]
-1.47
untu
-1.45
iddell
-1.44
deb
-1.43
ugu
-1.40
ambled
-1.38
reich
-1.38
POSITIVE LOGITS
practical
1.48
atics
1.45
between
1.45
Ĩ
1.42
¨
1.38
ľ
1.36
catal
1.35
Ļ
1.35
¤
1.33
aging
1.33
Activations Density 0.246%