INDEX
Explanations
phrases related to sharing recipes or food-related content
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1595
+0.08
0.2%
478
+0.08
0.2%
748
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1395
+0.08
0.03
1595
+0.08
0.03
748
+0.07
0.01
Negative Logits
wien
-0.93
maer
-0.90
Juf
-0.81
lele
-0.80
parma
-0.79
myn
-0.79
lara
-0.79
inder
-0.79
vespa
-0.78
tempe
-0.78
POSITIVE LOGITS
disappoint
1.05
disappointment
0.77
disappointed
0.66
disap
0.61
disappointing
0.59
exceeded
0.58
disappointments
0.58
expectations
0.57
失望
0.56
newOwner
0.56
Activations Density 0.248%