INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     sank
    -0.07
    /GL
    -0.07
    	Object
    -0.07
    .getY
    -0.07
     Era
    -0.07
     dumb
    -0.07
    publish
    -0.07
    13
    -0.07
    Kernel
    -0.07
    Stop
    -0.06
    POSITIVE LOGITS
     choice
    0.13
     choices
    0.09
     bliss
    0.07
     Choice
    0.07
    0.07
     бла
    0.07
     recourse
    0.07
    alie
    0.07
    choices
    0.07
    -choice
    0.07
    Act Density 0.016%

    No Known Activations