INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    eil
    -0.07
    jobs
    -0.07
     footprint
    -0.07
    bean
    -0.06
    bing
    -0.06
    -0.06
    /gif
    -0.06
    ounder
    -0.06
    boo
    -0.06
     Bhar
    -0.06
    POSITIVE LOGITS
    0.07
    250
    0.06
    .Acc
    0.06
     Mann
    0.06
    ?("
    0.06
    0.06
     manten
    0.06
     lawy
    0.06
     mak
    0.06
     FIR
    0.06
    Act Density 0.007%

    No Known Activations