INDEX
Explanations
proper nouns related to a story or narrative
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1350
+0.17
0.7%
370
+0.16
0.6%
596
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
370
+0.17
0.06
981
+0.16
0.06
1350
+0.14
0.05
Negative Logits
ElementRef
-0.52
Michel
-0.49
McNeil
-0.48
Henderson
-0.48
Albert
-0.47
Afrique
-0.47
dropIfExists
-0.47
Vince
-0.47
Rutland
-0.46
Schmid
-0.46
POSITIVE LOGITS
javier
1.23
leonardo
1.19
eduardo
1.17
sergio
1.16
fernando
1.15
roberto
1.14
ricardo
1.11
jorge
1.10
lorenzo
1.09
wikihow
1.09
Activations Density 0.369%