INDEX
    Explanations

    phrases related to tasks being completed or finished

    instances of the word "done."

    New Auto-Interp
    Negative Logits
    Å¡
    -0.70
    anth
    -0.65
    lement
    -0.62
    unin
    -0.61
    anta
    -0.59
     McCorm
    -0.58
     correspond
    -0.56
     Ann
    -0.56
    acas
    -0.54
    erald
    -0.53
    POSITIVE LOGITS
     done
    3.60
    done
    2.82
     Done
    2.29
    Done
    2.08
     accomplished
    1.77
     undertaken
    1.72
     performed
    1.68
     finished
    1.39
     completed
    1.38
     achieved
    1.34
    Act Density 0.021%

    No Known Activations