INDEX
Explanations
terms related to decentralization, specific organizations or entities, and terms related to specific industries or actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.16
0.5%
394
+0.13
0.4%
1842
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
507
+0.16
0.06
523
+0.13
0.04
1510
+0.12
0.05
Negative Logits
<bos>
-0.92
and
-0.78
и
-0.72
や
-0.70
-0.69
và
-0.68
.
-0.67
和
-0.67
と
-0.66
(
-0.66
POSITIVE LOGITS
effe
2.14
meis
2.10
inder
2.06
dises
2.02
„,
1.99
aen
1.99
kram
1.98
mef
1.98
fta
1.94
abnorm
1.94
Activations Density 0.552%