INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _bl
    -0.08
     Dead
    -0.07
    (bl
    -0.07
    _it
    -0.07
     God
    -0.07
    -rock
    -0.07
    ad
    -0.06
    IOD
    -0.06
    How
    -0.06
    	def
    -0.06
    POSITIVE LOGITS
     purchase
    0.13
     Purchase
    0.10
    0.10
     Purch
    0.10
     purchased
    0.10
     purchasing
    0.09
     purchaser
    0.09
    urch
    0.08
    Purchase
    0.08
    0.08
    Act Density 0.012%

    No Known Activations