INDEX
Explanations
instructions or steps related to technology or software usage
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
381
+0.11
0.3%
876
+0.10
0.3%
453
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
595
+0.11
0.04
1275
+0.10
0.04
1415
+0.09
0.03
Negative Logits
Michoacán
-0.73
Áng
-0.65
Meksi
-0.64
Almería
-0.64
Cádiz
-0.64
silikon
-0.62
ekos
-0.59
maksi
-0.57
Guanajuato
-0.56
Lajos
-0.55
POSITIVE LOGITS
impra
0.92
embodi
0.80
purcha
0.78
squa
0.76
coö
0.74
increa
0.73
seiz
0.72
scrat
0.70
resear
0.70
strick
0.70
Activations Density 0.222%