INDEX
Explanations
references to the concept of "doubling" or "double."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
307
+0.13
0.7%
196
+0.12
0.7%
420
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
196
+0.13
0.02
32
+0.12
0.02
275
+0.12
0.01
Negative Logits
INE
-1.64
ĵ
-1.61
ĸ
-1.61
NOT
-1.57
using
-1.55
frastructure
-1.54
rapeut
-1.48
IRE
-1.48
astro
-1.47
unnumbered
-1.47
POSITIVE LOGITS
heit
2.07
uple
1.94
sized
1.89
horn
1.86
entend
1.86
digits
1.72
dice
1.56
bow
1.56
jeopardy
1.55
ts
1.48
Activations Density 0.093%