INDEX
Explanations
instructions or code snippets relating to software development
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.18
0.6%
381
+0.16
0.5%
453
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1415
+0.18
0.09
1919
+0.16
0.07
1334
+0.13
0.05
Negative Logits
Meksi
-1.06
Româ
-1.06
parlamento
-1.02
Valentín
-0.99
rafra
-0.98
divertimento
-0.98
Lég
-0.97
Souha
-0.96
soigne
-0.95
demokra
-0.94
POSITIVE LOGITS
use
0.76
rechange
0.75
add
0.71
remove
0.70
adjust
0.70
apply
0.69
replace
0.69
modify
0.68
utilize
0.67
combine
0.66
Activations Density 0.417%