INDEX
    Explanations

    phrases and words associated with emotional expressions and moments of awareness

    New Auto-Interp
    Negative Logits
    orthy
    -0.84
    Published
    -0.79
    Guest
    -0.69
    ourses
    -0.65
    atre
    -0.64
    dule
    -0.64
    uthor
    -0.64
    missions
    -0.63
    ufact
    -0.62
    Recommend
    -0.62
    POSITIVE LOGITS
     stare
    0.81
     pesky
    0.80
     hump
    0.78
     thing
    0.78
     alone
    0.77
    milo
    0.72
     smile
    0.71
     itch
    0.70
     popping
    0.67
     grin
    0.67
    Act Density 0.162%

    No Known Activations