INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     revisions
    -0.07
    bbb
    -0.07
     originals
    -0.07
    	args
    -0.06
     slices
    -0.06
     ComponentFixture
    -0.06
    átel
    -0.06
     slice
    -0.06
     sui
    -0.06
     Mars
    -0.06
    POSITIVE LOGITS
     do
    0.13
     Do
    0.10
    "Do
    0.10
    Do
    0.10
    	do
    0.09
    /do
    0.08
    do
    0.08
     doing
    0.08
     DO
    0.08
    -do
    0.08
    Act Density 0.148%

    No Known Activations