INDEX
    Explanations

    phrases related to completing tasks or projects

    phrases related to strong emotional expressions or experiences

    New Auto-Interp
    Negative Logits
     Vaugh
    -0.67
    enegger
    -0.63
     helicop
    -0.60
     Moroc
    -0.56
    oppable
    -0.56
    bledon
    -0.56
     disadvant
    -0.55
     Frie
    -0.52
     incorpor
    -0.50
     Niet
    -0.50
    POSITIVE LOGITS
    \":
    0.63
    hs
    0.47
    photos
    0.45
     splash
    0.44
    lder
    0.44
     Halo
    0.43
     teasing
    0.42
    ':
    0.41
    ¶
    0.41
    rots
    0.41
    Act Density 2.165%

    No Known Activations