INDEX
Explanations
technical issues reported in a QA or testing context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.19
0.6%
344
+0.10
0.3%
1403
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.19
0.06
1415
+0.10
0.02
194
+0.09
0.01
Negative Logits
alkoh
-0.82
Milán
-0.80
kalori
-0.79
seksi
-0.78
Meksiko
-0.75
Cristóbal
-0.70
Meksi
-0.70
Libros
-0.69
Palestina
-0.69
organik
-0.68
POSITIVE LOGITS
troubleshooting
1.01
troubleshoot
0.99
exasper
0.90
Troubleshooting
0.79
solvable
0.77
diagnosing
0.76
infuriating
0.73
disreg
0.72
remedied
0.70
ennemi
0.70
Activations Density 0.617%