INDEX
Explanations
information related to scientific research findings, including details about discoveries, studies, and publications
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
596
+0.12
0.4%
1967
+0.12
0.4%
453
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
596
+0.12
0.04
1334
+0.12
0.03
369
+0.12
0.03
Negative Logits
»>
-1.13
fta
-1.11
thut
-1.09
ftu
-1.03
squa
-1.03
mef
-1.00
ftre
-1.00
„,
-0.98
Augu
-0.98
aen
-0.98
POSITIVE LOGITS
0.63
OF
0.59
approximately
0.57
Of
0.55
Até
0.53
של
0.53
sorts
0.50
of
0.49
about
0.48
DataPropertyName
0.48
Activations Density 0.178%