INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -framework
    -0.07
    .scenes
    -0.07
     Ark
    -0.07
     Marty
    -0.07
     B
    -0.06
    TRGL
    -0.06
     Kg
    -0.06
     sklearn
    -0.06
    -0.06
     идет
    -0.06
    POSITIVE LOGITS
     copper
    0.08
     Copper
    0.08
     paper
    0.07
     Liam
    0.06
     punch
    0.06
     COMMAND
    0.06
     Cooper
    0.06
     Fellow
    0.06
    kish
    0.06
    Fortunately
    0.06
    Act Density 0.011%

    No Known Activations