INDEX
Explanations
emotional reactions expressed through exclamations or rhetorical questions
New Auto-Interp
Negative Logits
vale
-0.90
ourse
-0.87
etheless
-0.83
guiActiveUn
-0.77
isSpecialOrderable
-0.74
eatures
-0.71
staking
-0.70
interstitial
-0.70
DragonMagazine
-0.69
inction
-0.68
POSITIVE LOGITS
please
0.96
oh
0.91
yeah
0.91
why
0.90
huh
0.85
hurry
0.81
maybe
0.79
let
0.78
WHY
0.77
WHAT
0.74
Activations Density 0.040%