INDEX
Explanations
descriptions of baking recipes and ingredients
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.23
0.8%
736
+0.13
0.4%
519
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.23
0.07
519
+0.13
0.04
876
+0.12
-0.01
Negative Logits
Jurist
-0.95
monaster
-0.83
Hauptmann
-0.75
kemer
-0.75
Rektor
-0.72
cittad
-0.72
Kriminal
-0.71
Bundestag
-0.69
Presidencia
-0.69
Conferencia
-0.68
POSITIVE LOGITS
ftu
1.44
increa
1.33
fta
1.31
purcha
1.28
guarante
1.28
affor
1.25
embodi
1.25
poff
1.23
disagre
1.21
ftre
1.19
Activations Density 0.356%