INDEX
Explanations
technical details and descriptions related to tools or procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.14
0.4%
1531
+0.10
0.3%
604
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.14
0.05
1531
+0.10
0.04
1791
+0.09
0.03
Negative Logits
<bos>
-0.92
Öster
-0.83
maksi
-0.81
Keny
-0.79
lü
-0.79
keramik
-0.78
uhr
-0.77
territo
-0.74
panik
-0.73
Heeren
-0.71
POSITIVE LOGITS
requires
0.68
require
0.66
disadvantages
0.65
disadvantage
0.64
Lmfao
0.60
Requires
0.59
drawback
0.59
tricot
0.59
sightly
0.59
expensive
0.58
Activations Density 0.384%