INDEX
Explanations
ingredients and cooking instructions in recipes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.11
0.3%
736
+0.11
0.3%
549
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.11
0.05
1284
+0.11
0.03
1551
+0.11
0.03
Negative Logits
churrasco
-0.94
tortas
-0.91
cammin
-0.90
piña
-0.83
monaster
-0.82
gubern
-0.80
barbacoa
-0.80
pican
-0.80
pandan
-0.79
trion
-0.78
POSITIVE LOGITS
clayey
0.84
spice
0.81
friable
0.81
spices
0.71
pepper
0.67
flavor
0.64
🌶
0.61
anhyd
0.61
seasoning
0.61
blackish
0.61
Activations Density 0.149%