INDEX
Explanations
descriptions related to app features and possible improvements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.15
0.5%
1446
+0.12
0.4%
678
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.15
0.07
1415
+0.12
0.03
1446
+0.11
0.03
Negative Logits
Khart
-1.31
Bartholo
-1.24
Keny
-1.23
Juf
-1.18
Glou
-1.15
Hez
-1.11
Shakspeare
-1.11
Gorb
-1.11
Abbé
-1.10
Mahomet
-1.08
POSITIVE LOGITS
user
0.74
usability
0.66
users
0.66
kwi
0.63
frustra
0.62
fiets
0.61
functionality
0.60
ergonomic
0.60
bewah
0.59
user
0.59
Activations Density 0.535%