INDEX
Explanations
references to financial figures and costs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1334
+0.18
0.6%
674
+0.17
0.6%
1896
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1334
+0.18
0.13
920
+0.17
0.07
161
+0.12
0.07
Negative Logits
antity
-0.76
haviour
-0.70
esterday
-0.69
cember
-0.68
phalt
-0.68
intenance
-0.66
„,
-0.66
uestions
-0.65
iented
-0.65
Cæsar
-0.65
POSITIVE LOGITS
Ainda
0.60
achieve
0.59
ensure
0.59
prevent
0.58
avoid
0.56
wieś
0.56
achieve
0.55
Talvez
0.54
comply
0.54
facilitate
0.54
Activations Density 0.527%