INDEX
Explanations
references to technical information related to computer programming and architecture
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.20
0.6%
184
+0.12
0.4%
856
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
764
+0.20
0.05
1948
+0.12
0.05
1819
+0.10
0.06
Negative Logits
bunda
-0.80
alkoh
-0.75
palav
-0.74
lele
-0.73
keramik
-0.71
kram
-0.71
beren
-0.71
panik
-0.68
uhr
-0.68
akku
-0.66
POSITIVE LOGITS
or
0.68
Fitment
0.66
mocht
0.62
NDEBUG
0.59
gänzlich
0.57
responseData
0.56
dacht
0.55
newVal
0.55
tagName
0.54
sceptre
0.53
Activations Density 0.551%