INDEX
    Explanations

    references to editing or modifications in online content

    instances of the word "Edit" in the text

    New Auto-Interp
    Negative Logits
     baskets
    -0.70
     heck
    -0.70
     clutch
    -0.67
     buckets
    -0.65
     basket
    -0.63
     hoops
    -0.62
     rightfully
    -0.62
    ibaba
    -0.58
     thunder
    -0.58
    Harris
    -0.57
    POSITIVE LOGITS
     Edit
    4.06
    edit
    2.05
     edit
    1.88
    Edit
    1.81
    ".[
    1.38
    ).[
    1.28
    .[
    1.26
    :[
    1.24
    Contents
    1.22
    ¶
    1.18
    Act Density 0.018%

    No Known Activations