INDEX
    Explanations

    corrections or editorial notes within text

    instances of the word "correction" and its variations

    New Auto-Interp
    Negative Logits
    atos
    -0.94
    bang
    -0.76
    axy
    -0.72
    ramid
    -0.69
    ulhu
    -0.68
    gha
    -0.67
    aden
    -0.65
    WAYS
    -0.65
    ahan
    -0.64
    enf
    -0.64
    POSITIVE LOGITS
     inaccur
    0.77
     leveled
    0.76
    Correction
    0.74
    Reviewed
    0.71
     Corrections
    0.69
    ettings
    0.68
     Correction
    0.66
     empt
    0.66
    Correct
    0.66
     inaccurate
    0.65
    Act Density 0.027%

    No Known Activations