INDEX
Explanations
references to desserts or sweet treats, particularly cakes
references to cake in various contexts
New Auto-Interp
Negative Logits
nesota
-0.81
ually
-0.74
ENTION
-0.73
iveness
-0.71
ostics
-0.70
Lomb
-0.67
OSE
-0.66
Cheong
-0.64
Fargo
-0.63
selves
-0.62
POSITIVE LOGITS
cake
0.90
cakes
0.89
meal
0.89
batter
0.88
cake
0.88
walk
0.86
pillar
0.83
hop
0.80
cakes
0.80
decor
0.77
Activations Density 0.024%