INDEX
    Explanations

    factual corrections in news articles

    corrections

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.78
    TintMode
    -0.75
    gameserver
    -0.69
     SwitchCompat
    -0.64
    ValueStyle
    -0.63
    RegressionTest
    -0.59
    CppCodeGen
    -0.59
    ########.
    -0.56
     مرئيه
    -0.53
     vPvB
    -0.52
    POSITIVE LOGITS
    __':
    
    0.71
    ...');
    0.71
     snippetHide
    0.68
    '):
    
    0.67
    >");
    
    0.64
     Efq
    0.64
    ')):
    0.63
    ]');
    0.63
    />";
    0.62
     />";
    0.62
    Act Density 0.434%

    No Known Activations