INDEX
Explanations
phrases related to balancing goals and resources
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.25
0.9%
2034
+0.16
0.6%
1013
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
284
+0.25
0.10
1013
+0.16
0.13
2034
+0.12
0.12
Negative Logits
<bos>
-2.41
//*/
-0.97
/***
-0.80
ඉ
-0.71
Kontrola
-0.71
ඒ
-0.69
ඔ
-0.69
Vaata
-0.67
posób
-0.67
උ
-0.66
POSITIVE LOGITS
lele
1.00
optik
0.90
bandung
0.90
kaos
0.88
benzina
0.85
Glou
0.85
heyd
0.85
seoul
0.84
NDEBUG
0.84
actionTypes
0.84
Activations Density 1.227%