INDEX
Explanations
mentions of breakfast
references to breakfast
New Auto-Interp
Negative Logits
lest
-0.76
Torn
-0.72
ifer
-0.70
viol
-0.68
yrights
-0.68
Reeves
-0.67
](
-0.66
warranted
-0.65
techn
-0.65
alties
-0.65
POSITIVE LOGITS
breakfast
3.85
Breakfast
2.81
brunch
2.34
lunch
2.26
supper
2.16
dinner
2.13
meals
1.88
pancakes
1.83
meal
1.72
Lunch
1.65
Activations Density 0.014%