INDEX
    Explanations

    terms related to correctness and incorrectness

    "Correct" and "wrong" labels or predictions

    correct or wrong assessments

    New Auto-Interp
    Negative Logits
     AssemblyCompany
    -0.88
    tagHelperRunner
    -0.79
     ostavi
    -0.75
     newBuilder
    -0.74
     المعيارى
    -0.71
    InitVars
    -0.69
    +#+#
    -0.69
    awaiter
    -0.69
    InjectAttribute
    -0.68
     كومونز
    -0.68
    POSITIVE LOGITS
     Wrong
    0.89
    Wrong
    0.78
     wrong
    0.78
     错误
    0.76
     Incorrect
    0.76
     WRONG
    0.76
     wrongs
    0.74
    WRONG
    0.74
     incorrect
    0.72
    wrong
    0.72
    Act Density 0.200%

    No Known Activations