INDEX
    Explanations

    words related to achieving success or making progress

    gerunds and present participles related to actions

    New Auto-Interp
    Negative Logits
     wa
    -0.62
     Bei
    -0.62
     Bai
    -0.59
    rium
    -0.57
     apples
    -0.57
     apple
    -0.57
    \<
    -0.56
    aple
    -0.56
    OTUS
    -0.56
     Toby
    -0.56
    POSITIVE LOGITS
    kefeller
    0.99
    restling
    0.89
    backer
    0.87
    issance
    0.79
    IGH
    0.74
     Racer
    0.73
    itual
    0.73
    PAC
    0.72
    esh
    0.72
    iking
    0.71
    Act Density 0.057%

    No Known Activations