INDEX
Explanations
recipes or cooking instructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.13
0.4%
1013
+0.12
0.4%
736
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.13
0.05
753
+0.12
0.01
1013
+0.09
0.04
Negative Logits
pamph
-1.35
intersper
-1.06
Darío
-1.05
Muhamma
-0.99
reluct
-0.99
Keny
-0.98
Abbé
-0.96
Simult
-0.95
shenan
-0.95
Bartholo
-0.94
POSITIVE LOGITS
storage
0.76
stored
0.73
storage
0.69
refrigerator
0.68
Storage
0.66
storing
0.65
shelf
0.65
fridge
0.64
Storage
0.61
stored
0.61
Activations Density 0.253%