INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ipc
    -0.08
     Bennett
    -0.07
     complaint
    -0.07
     dbc
    -0.07
    ipc
    -0.07
     Complaint
    -0.07
    Section
    -0.07
     Madame
    -0.07
     Blitz
    -0.07
     elimination
    -0.06
    POSITIVE LOGITS
     grow
    0.18
     growing
    0.16
     grew
    0.14
     grown
    0.12
     Grow
    0.12
     Growing
    0.12
     grows
    0.12
    Grow
    0.12
    grow
    0.11
    Growing
    0.10
    Act Density 0.017%

    No Known Activations