INDEX
    Explanations

    phrases related to deletion or removal actions

    instances of the word "delete" in various forms and contexts

    New Auto-Interp
    Negative Logits
     Serv
    -0.71
     Semin
    -0.68
     Providence
    -0.66
     pitched
    -0.66
     Olymp
    -0.65
     Baker
    -0.64
     Rising
    -0.63
     Compet
    -0.63
     Watts
    -0.63
     benef
    -0.63
    POSITIVE LOGITS
     delete
    3.43
    delete
    2.55
     deleting
    2.46
     delet
    2.32
    Delete
    2.24
     deletion
    2.23
     Delete
    2.17
     erase
    1.75
     overwrite
    1.63
     uninstall
    1.61
    Act Density 0.020%

    No Known Activations