INDEX
    Explanations

    emotional reactions expressed through exclamations or rhetorical questions

    New Auto-Interp
    Negative Logits
    vale
    -0.90
    ourse
    -0.87
    etheless
    -0.83
     guiActiveUn
    -0.77
    isSpecialOrderable
    -0.74
    eatures
    -0.71
    staking
    -0.70
    interstitial
    -0.70
    DragonMagazine
    -0.69
    inction
    -0.68
    POSITIVE LOGITS
     please
    0.96
     oh
    0.91
     yeah
    0.91
     why
    0.90
     huh
    0.85
     hurry
    0.81
     maybe
    0.79
     let
    0.78
     WHY
    0.77
     WHAT
    0.74
    Act Density 0.040%

    No Known Activations