INDEX
    Explanations

    mentions or instances of the word "confirmation"

    repeated occurrences of the word "confirmation."

    New Auto-Interp
    Negative Logits
    bler
    -0.73
    psc
    -0.72
    hner
    -0.72
    enium
    -0.71
    ð
    -0.71
     Icar
    -0.69
    @#&
    -0.68
     ILCS
    -0.67
    sites
    -0.66
    DIR
    -0.66
    POSITIVE LOGITS
    irmation
    1.25
    atory
    1.04
    ance
    0.88
    irming
    0.86
    ially
    0.84
     confirmation
    0.84
    irms
    0.79
     validity
    0.78
    irmed
    0.76
    essions
    0.76
    Act Density 0.031%

    No Known Activations