INDEX
Explanations
references to meals and dining experiences
New Auto-Interp
Negative Logits
Prem
-0.15
alle
-0.15
Mood
-0.14
ermann
-0.14
.proxy
-0.14
Expl
-0.13
avana
-0.13
浦
-0.13
Material
-0.13
alle
-0.13
POSITIVE LOGITS
lunch
0.43
dinner
0.39
meals
0.36
breakfast
0.35
Lunch
0.34
meal
0.31
Dinner
0.30
lunches
0.28
Breakfast
0.27
supper
0.24
Activations Density 0.217%