INDEX
Explanations
countries and regions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.16
0.5%
1013
+0.10
0.3%
227
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
342
+0.16
0.05
1438
+0.10
0.04
1935
+0.10
0.05
Negative Logits
fte
-1.13
fta
-1.11
ftu
-1.11
fep
-1.03
mef
-1.03
perfon
-1.03
tew
-1.02
aen
-1.01
„,
-0.99
ftre
-0.97
POSITIVE LOGITS
with
0.66
whose
0.65
capable
0.57
that
0.56
without
0.55
ardı
0.52
internalType
0.51
.
0.51
ophenol
0.51
WITH
0.50
Activations Density 0.347%