INDEX
Explanations
instances related to financial aspects and organizations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1187
+0.15
0.6%
874
+0.12
0.5%
31
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1187
+0.15
0.02
629
+0.12
0.02
874
+0.12
0.02
Negative Logits
effe
-0.80
inev
-0.80
encomp
-0.78
overla
-0.77
disagre
-0.77
maer
-0.75
maneu
-0.74
unve
-0.74
suscep
-0.72
depic
-0.72
POSITIVE LOGITS
foundation
1.61
Foundation
1.59
Foundation
1.46
foundation
1.43
foundations
1.42
Foundations
1.26
FOUNDATION
1.26
Foundations
1.00
FOUND
0.79
Stiftung
0.76
Activations Density 0.091%