INDEX
    Explanations

    multiple choice options

    New Auto-Interp
    Negative Logits
    0.95
    waitForIdleSync
    0.93
    0.92
    0.91
    0.90
    0.85
    0.84
    attiyam
    0.84
    0.84
    0.84
    POSITIVE LOGITS
    1.00
     
    0.91
    0.89
    0.84
    0.83
    0.83
    0.82
     B
    0.82
    0.80
    0.80
    Act Density 0.000%

    No Known Activations