INDEX
    Explanations

    Lan Cao, XML, sole legal, folder

    New Auto-Interp
    Negative Logits
    !\
    0.48
    !
    0.47
    io
    0.44
     Dark
    0.44
     post
    0.43
    .\
    0.43
     Read
    0.43
    ,\
    0.42
     Amazon
    0.42
     for
    0.42
    POSITIVE LOGITS
    ↵↵↵↵↵↵↵↵
    0.67
    ↵↵↵↵↵↵
    0.64
    ↵↵↵↵
    0.62
    ↵↵↵↵↵↵↵↵↵↵
    0.59
    ↵↵↵↵↵↵↵
    0.57
    ↵↵↵↵↵↵↵↵↵↵↵↵↵↵
    0.56
    ↵↵↵↵↵↵↵↵↵↵↵↵
    0.55
    ↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
    0.53
    ↵↵↵↵↵↵↵↵↵↵↵↵↵
    0.52
    ↵↵↵↵↵↵↵↵↵
    0.51
    Act Density 0.000%

    No Known Activations