INDEX
Explanations
names of categories or labels within a system
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.13
0.4%
453
+0.12
0.3%
1978
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.13
0.02
1108
+0.12
0.02
453
+0.11
0.02
Negative Logits
jsPsych
-0.70
mergeFrom
-0.70
contentLoaded
-0.67
JspWriter
-0.65
betweenstory
-0.64
complexContent
-0.63
ցված
-0.63
Bibliograf
-0.61
'{@-0.60
;%%
-0.60
POSITIVE LOGITS
squa
1.70
unden
1.69
strick
1.67
inev
1.66
stockholm
1.65
?...
1.64
increa
1.63
volunte
1.61
affor
1.61
compen
1.61
Activations Density 0.032%