INDEX
Explanations
instructions and tips related to software development processes, particularly in the context of observing, delaying action, and understanding context before making changes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.11
0.3%
2036
+0.09
0.3%
147
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
955
+0.11
0.06
2036
+0.09
0.03
1467
+0.08
0.04
Negative Logits
ConstraintMaker
-0.66
&___
-0.61
autunno
-0.60
migli
-0.58
Seeder
-0.55
مرئيه
-0.54
récon
-0.54
alun
-0.54
pensi
-0.52
writeFieldEnd
-0.51
POSITIVE LOGITS
apprehen
1.13
pamph
1.07
gaily
0.99
unspeak
0.98
vainly
0.94
disagre
0.94
tolerably
0.91
reluct
0.90
intersper
0.88
endeavouring
0.88
Activations Density 0.453%