INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Luke
    -0.07
     quadrant
    -0.06
    orange
    -0.06
    	Camera
    -0.06
     impact
    -0.06
    places
    -0.06
     Huang
    -0.06
     Slo
    -0.06
     flames
    -0.06
     zaměř
    -0.06
    POSITIVE LOGITS
     petition
    0.14
     petitions
    0.13
     Peterson
    0.07
     Poll
    0.07
     petitioner
    0.07
     GT
    0.07
    toi
    0.07
    Pet
    0.07
     myslí
    0.07
    etxt
    0.07
    Act Density 0.007%

    No Known Activations