INDEX
Explanations
commands and steps related to setting up and using software or technical tools
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.15
0.4%
1013
+0.13
0.4%
1177
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1415
+0.15
0.04
1038
+0.13
0.04
453
+0.10
0.06
Negative Logits
fta
-1.21
ftu
-1.11
bourgeo
-1.11
increa
-1.10
fte
-1.09
alre
-1.08
quitted
-1.08
suscep
-1.07
intermitt
-1.06
bordeaux
-1.05
POSITIVE LOGITS
develop
0.83
create
0.81
design
0.80
designing
0.80
<bos>
0.75
construct
0.75
build
0.75
creation
0.74
created
0.73
creating
0.72
Activations Density 0.589%