INDEX
Explanations
references to historical events and geographical locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
599
+0.31
1.2%
964
+0.27
1.1%
764
+0.17
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
599
+0.31
0.06
964
+0.27
0.03
1804
+0.17
0.03
Negative Logits
ImageBackground
-0.45
shadowOffset
-0.45
fillText
-0.44
Winaray
-0.43
IsNotEmpty
-0.42
btnClose
-0.41
btnBack
-0.41
subfigure
-0.41
XmlEnum
-0.40
btnDelete
-0.39
POSITIVE LOGITS
ighborhood
0.58
Allegretto
0.58
ivelany
0.56
izational
0.55
»>
0.54
ftu
0.54
congr
0.53
blackpink
0.53
dafx
0.53
ghijkl
0.53
Activations Density 0.241%