INDEX
Explanations
words related to algorithms, mathematical concepts, or computer programming
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
893
+0.20
1.0%
67
+0.13
0.6%
889
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
893
+0.20
0.03
732
+0.13
0.02
67
+0.12
0.02
Negative Logits
AssemblyCulture
-0.49
Claudia
-0.49
Claudia
-0.47
HasIndex
-0.47
קישורים
-0.46
Pá
-0.45
twimg
-0.45
HideInInspector
-0.45
Abril
-0.44
SizeMode
-0.44
POSITIVE LOGITS
Fer
1.37
Fer
1.29
FER
1.26
fer
1.13
ferret
1.07
Fergus
1.06
fer
1.04
Ferrell
1.01
FERR
1.00
Ferguson
0.99
Activations Density 0.120%