INDEX
    Explanations

    mentions or instances of the word "proof" in various contexts

    instances of the word "proof" and its variations, indicating a focus on verification or evidence

    New Auto-Interp
    Negative Logits
     livest
    -0.79
    artney
    -0.69
     Osc
    -0.69
    ideshow
    -0.68
    lished
    -0.68
    ufact
    -0.67
     Peninsula
    -0.66
    asions
    -0.66
    iewicz
    -0.66
     contrace
    -0.65
    POSITIVE LOGITS
    reading
    1.23
    reader
    1.07
    read
    0.95
    edly
    0.94
    ificate
    0.88
     proof
    0.85
    ing
    0.85
    ingen
    0.83
     proofs
    0.82
    uers
    0.81
    Act Density 0.016%

    No Known Activations