INDEX
Explanations
terms related to economic and governmental systems
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1705
+0.16
0.5%
289
+0.09
0.3%
36
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1705
+0.16
0.08
289
+0.09
0.02
169
+0.07
0.03
Negative Logits
vœ
-0.63
lizabeth
-0.62
disagre
-0.62
uests
-0.61
ceff
-0.61
reluct
-0.59
burton
-0.58
malheur
-0.58
intersper
-0.57
prendra
-0.56
POSITIVE LOGITS
mechanism
0.91
mechanisms
0.88
system
0.84
mechanism
0.79
system
0.74
systems
0.74
Mechanism
0.69
sistem
0.69
Mechanisms
0.68
ystem
0.66
Activations Density 0.449%