INDEX
Explanations
ingredients and recipes in a food-related context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.11
0.3%
674
+0.09
0.3%
801
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
801
+0.11
0.03
620
+0.09
0.03
609
+0.08
0.02
Negative Logits
pamph
-0.78
shenan
-0.74
intersper
-0.73
unspeak
-0.69
frivol
-0.68
celtic
-0.68
kraken
-0.66
vainly
-0.66
cuck
-0.66
intrigu
-0.66
POSITIVE LOGITS
NSCoder
0.62
RenderAtEndOf
0.62
cotone
0.61
lijah
0.60
habet
0.58
XmlAccessorType
0.55
boldmath
0.54
Портали
0.53
etiam
0.53
gomma
0.52
Activations Density 0.083%