INDEX
Explanations
verbs related to programming and software development processes, such as creating, developing, and writing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
678
+0.14
0.4%
1984
+0.14
0.4%
468
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.14
0.08
911
+0.14
0.04
1531
+0.10
0.04
Negative Logits
désigne
-0.67
résulte
-0.66
contribue
-0.64
restera
-0.64
occupe
-0.64
ressemble
-0.62
devra
-0.62
apparaît
-0.59
accueille
-0.59
affirme
-0.59
POSITIVE LOGITS
hairc
0.96
considér
0.95
délib
0.89
mahd
0.85
osal
0.85
semblables
0.84
pleins
0.83
écout
0.83
destinées
0.83
embodi
0.81
Activations Density 0.542%