INDEX
Explanations
keywords or phrases related to dining experiences
Punctuation followed by capitalized words
bonus offers
New Auto-Interp
Negative Logits
]--;
-0.73
$",
-0.65
)";
-0.63
()",
-0.62
TestBed
-0.61
)");
-0.60
]`
-0.59
)',
-0.59
)",
-0.59
}\]
-0.58
POSITIVE LOGITS
Plus
0.81
Oh
0.79
Plus
0.78
Bonus
0.77
Oh
0.76
Bonus
0.75
BONUS
0.74
oh
0.70
bonus
0.68
bonus
0.68
Activations Density 0.184%