INDEX
Explanations
names of specific strategies or departments within an organizational context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.15
0.6%
555
+0.15
0.6%
251
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
251
+0.15
0.03
555
+0.15
0.03
1806
+0.14
0.03
Negative Logits
robus
-0.65
reger
-0.59
democ
-0.58
remplace
-0.58
libere
-0.57
pessi
-0.57
opio
-0.56
Läs
-0.55
revan
-0.55
ferait
-0.55
POSITIVE LOGITS
strategy
1.38
Strategy
1.24
strategy
1.21
strategies
1.18
Strategy
1.10
Strategies
1.05
STRATEGY
1.02
strategies
0.98
égias
0.98
strategic
0.96
Activations Density 0.064%