INDEX
Explanations
phrases related to relocation and adjustment
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.09
0.3%
2025
+0.07
0.2%
1537
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1118
+0.09
0.04
1962
+0.07
0.04
742
+0.07
0.02
Negative Logits
inappro
-1.05
impra
-1.03
affor
-0.99
disagre
-0.95
michel
-0.95
Cfr
-0.94
strick
-0.94
unden
-0.93
increa
-0.93
suscep
-0.92
POSITIVE LOGITS
successive
0.71
phazard
0.65
different
0.63
constantly
0.61
various
0.61
varying
0.61
depending
0.60
periods
0.57
changing
0.57
alternating
0.56
Activations Density 0.467%