INDEX
Explanations
words related to data structures or programming functions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.11
0.6%
146
+0.11
0.6%
483
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
483
+0.11
0.16
146
+0.11
0.10
219
+0.10
0.11
Negative Logits
ļ
-1.95
į
-1.83
§
-1.75
ł
-1.67
ĨĴ
-1.63
¦
-1.60
Ļª
-1.54
Ĵ
-1.50
Ĭ
-1.48
Ģ
-1.47
POSITIVE LOGITS
gage
1.67
aintiff
1.52
WHETHER
1.44
ynes
1.43
.).
1.42
sells
1.42
//!
1.40
).](
1.39
cas
1.38
essen
1.38
Activations Density 6.228%