INDEX
Explanations
terms related to server selection and configuration in a technical context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
67
+0.15
0.6%
1624
+0.13
0.5%
1937
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
67
+0.15
0.03
1624
+0.13
0.02
1124
+0.12
0.02
Negative Logits
roth
-0.59
underta
-0.58
magi
-0.54
Wynne
-0.54
gaily
-0.54
vry
-0.53
disgra
-0.52
Sequo
-0.51
apprehen
-0.50
greate
-0.50
POSITIVE LOGITS
server
1.45
Server
1.36
server
1.36
servers
1.28
Server
1.25
Servers
1.14
SERVER
1.12
SERVER
1.07
servers
1.02
Servers
0.99
Activations Density 0.081%