INDEX
Explanations
proper nouns or specific terms, potentially related to products or locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
866
+0.09
0.3%
228
+0.08
0.2%
1964
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.09
0.03
1550
+0.08
0.02
1066
+0.07
0.02
Negative Logits
Ka
-0.60
otheby
-0.56
criptures
-0.55
ColumnHeaders
-0.53
icznego
-0.51
tangentMode
-0.51
defaultstate
-0.50
underland
-0.50
emissions
-0.49
IGraphics
-0.49
POSITIVE LOGITS
ka
1.94
KA
1.29
ftu
1.21
sovere
1.18
stockholm
1.18
tranf
1.16
reft
1.14
sappi
1.11
perfon
1.11
nece
1.10
Activations Density 0.191%