INDEX
Explanations
words related to deployment, especially in a military context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1778
+0.15
0.6%
1870
+0.14
0.5%
990
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1778
+0.15
0.03
990
+0.14
0.02
240
+0.12
0.02
Negative Logits
qualifi
-0.57
haver
-0.51
Fag
-0.51
Fag
-0.51
Xna
-0.50
Schna
-0.50
Brag
-0.50
FSC
-0.48
Sic
-0.46
Paredes
-0.46
POSITIVE LOGITS
deployment
1.29
deployments
1.25
deploy
1.25
deployed
1.18
Deployment
1.16
deploying
1.12
deployment
1.08
deployed
1.07
Deploy
1.04
deploy
1.03
Activations Density 0.063%