INDEX
    Explanations

    mentions of corrections or updates in articles

    New Auto-Interp
    Negative Logits
    akuya
    -0.61
    stones
    -0.59
    omorph
    -0.58
    ometers
    -0.56
    goers
    -0.55
    vs
    -0.54
    ibles
    -0.54
    roma
    -0.53
    ologne
    -0.53
    mods
    -0.53
    POSITIVE LOGITS
     Updated
    0.64
     headline
    0.62
     reporting
    0.60
     dispatch
    0.59
    Published
    0.59
     REPORT
    0.56
     WARN
    0.55
     HuffPost
    0.55
     article
    0.55
     typo
    0.55
    Act Density 8.186%

    No Known Activations