INDEX
Explanations
software and technology-related terms and updates
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.12
0.4%
453
+0.11
0.3%
50
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
499
+0.12
0.04
283
+0.11
0.01
1257
+0.10
0.03
Negative Logits
intersper
-1.64
encomp
-1.63
shenan
-1.57
snoopy
-1.53
impra
-1.50
depic
-1.49
disagre
-1.49
increa
-1.48
apprehen
-1.47
unve
-1.46
POSITIVE LOGITS
utop
1.22
balon
1.13
meras
1.09
ortop
1.02
marte
1.02
kosme
1.00
spion
1.00
tenda
0.99
teras
0.99
elek
0.97
Activations Density 0.159%