INDEX
    Explanations

    words associated with congratulations and good luck

    congratulatory messages and expressions of celebration

    New Auto-Interp
    Negative Logits
    improve
    -0.72
    hover
    -0.69
    capacity
    -0.66
    sit
    -0.65
     disadvantage
    -0.65
    ready
    -0.64
    uggest
    -0.63
    isol
    -0.63
    ahon
    -0.63
    itta
    -0.63
    POSITIVE LOGITS
     sir
    0.88
    !!!
    0.88
    !
    0.87
    !!
    0.87
     comrade
    0.87
     everyone
    0.86
     everybody
    0.81
     guys
    0.80
     @
    0.79
    !,
    0.77
    Act Density 0.085%

    No Known Activations