INDEX
Explanations
words or prefixes related to scientific or technical content
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
678
+0.17
0.6%
513
+0.14
0.5%
1363
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.17
0.08
1363
+0.14
0.07
1965
+0.13
0.05
Negative Logits
to
-0.68
(
-0.67
.
-0.66
and
-0.64
time
-0.64
in
-0.64
възможно
-0.64
настоящий
-0.63
by
-0.63
,
-0.63
POSITIVE LOGITS
sappi
1.87
migli
1.71
abbra
1.69
incess
1.68
tramont
1.65
ritard
1.65
squa
1.64
erec
1.63
immen
1.63
embra
1.63
Activations Density 0.360%