INDEX
    Explanations

    phrases related to proving or demonstrating something

    elements related to proof concepts and evidence

    New Auto-Interp
    Negative Logits
    lance
    -0.74
    adelphia
    -0.72
    seek
    -0.70
    scape
    -0.65
    rouse
    -0.64
    bleacher
    -0.64
    erest
    -0.63
    otom
    -0.63
    icularly
    -0.62
     Braun
    -0.61
    POSITIVE LOGITS
     allegiance
    0.78
     existence
    0.75
     nonex
    0.74
     equival
    0.73
     authenticity
    0.70
    concept
    0.70
     innocence
    0.68
     correctness
    0.67
    empt
    0.66
     cancellation
    0.64
    Act Density 0.050%

    No Known Activations