INDEX
Explanations
scripts or writing instructions related to creating something
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1677
+0.16
0.6%
1757
+0.15
0.6%
645
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1677
+0.16
0.03
1637
+0.15
0.02
893
+0.13
0.02
Negative Logits
intit
-0.52
radeon
-0.51
dentes
-0.50
Fortun
-0.48
Persson
-0.48
Adnan
-0.48
disting
-0.48
accla
-0.47
betra
-0.46
Balk
-0.46
POSITIVE LOGITS
script
1.54
scripts
1.46
script
1.36
Script
1.36
Script
1.28
SCRIPT
1.23
Scripts
1.23
SCRIPT
1.20
scripting
1.20
scripts
1.12
Activations Density 0.086%