INDEX
    Explanations

    words related to success or achievement

    New Auto-Interp
    Negative Logits
    thing
    -0.16
     seriously
    -0.16
    nell
    -0.14
    v
    -0.14
    liness
    -0.14
    è¾°
    -0.14
    ellas
    -0.14
     Bucc
    -0.13
    xce
    -0.13
    ego
    -0.13
    POSITIVE LOGITS
    ively
    0.29
    ive
    0.26
    antly
    0.20
    iven
    0.19
    ingly
    0.17
    ced
    0.16
    ãĥ³ãĤ¹
    0.16
    ivec
    0.16
    full
    0.15
    SSION
    0.15
    Act Density 0.013%

    No Known Activations