INDEX
Explanations
mentions of uncertainties or speculative events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1133
+0.10
0.3%
1103
+0.10
0.3%
1265
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1265
+0.10
0.04
1435
+0.10
0.05
490
+0.09
0.03
Negative Logits
embra
-1.26
emphat
-1.16
accla
-1.08
Simult
-1.04
sputnik
-1.04
compen
-1.04
maneu
-1.04
immen
-1.04
applau
-1.02
pessi
-1.02
POSITIVE LOGITS
might
1.05
may
1.03
maybe
0.98
could
0.92
possibly
0.91
maybe
0.91
perhaps
0.89
might
0.89
may
0.84
Maybe
0.82
Activations Density 0.400%